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METHOD OF PREDICTING THE ABILITY OF COMPOUNDS 
TO MODULATE THE BIOLOGICAL ACTIVITY OF RECEPTORS 

This application is a continuation-in-part of 
PCT/US99/06664, filed March 26, 1999, which is a 
5 continuation-in-part of 60/115,345, filed January 8, 1999, 
which is a continuation-in-part of Paige, et al . , Serial No. 
60/099,656, filed September 9, 1998, which is a 
continuation-in-part of Paige, et al . , Serial No. 60/082, 
756, filed April 23, 1998. All of the above applications 
10 are hereby incorporated-by-ref erence . 

Cross-Ref erence to Related Applications 

Thorp, Serial No. 08/904,842, METHOD OF IDENTIFYING AND 
DEVELOPING DRUG LEADS WHICH MODULATE THE ACTIVITY OF A 
TARGET PROTEIN, discloses several methods of identifying 

15 drug leads. In essence a protein of interest, in one or 

more states, is characterized by (a) its chemical reactivity 
with one or more characterizing reagents, and/or (b) its 
binding to one or more aptamers (especially nucleic acids) , 
generating an array of descriptors by which it may be 

20 characterized as more or less similar for reference proteins 
for which an equivalent array of descriptors have been 
generated, and for which one or more activity-mediating 
reference drugs are known. Suitable drug leads for the 
protein of interest are those analogous to the reference 

25 drugs for the more similar reference proteins. 

Fowlkes, et al . PCT/US97/19638 , 08/740,671, 09/050,359 
and 09/069,827, IDENTIFICATION OF DRUGS USING COMPLEMENTARY 
COMBINATORIAL LIBRARIES, disclose the use of a first 
combinatorial library, e.^., of peptides, to obtain a set of 

3 0 binding peptides that can serve as a surrogate for the 
natural ligand of a target protein. A small organic 
compound library (preferably combinatorial in nature) is 
then screened for compounds which inhibit the binding of the 
surrogates to the target protein. 

35 All of the aboye applications are hereby incorporated 

by reference. 



0 • 



Mention of Government Support 

Some of the work disclosed herein was funded by the 
U.S. government through NIH Grant DK 48 807 to Donald P. 
McDonnell. The U.S. Government may have certain rights in 
5 the invention. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates to a method of identifying drugs 
which can mediate the biological activity of a target 
10 protein. It also relates to reagents, especially peptides, 
useful in that method, or more directly in mediating the 
biological activity of said target protein or in binding to 
said target protein themselves . 

Description of the Background Art 

15 Protein Binding a.nd Biological Activity 

Many of the biological activities of the proteins are 
attributable to their ability to bind specifically to one or 
more binding partners (ligands) , which may themselves be 
proteins, or other biomolecules . 

20 When the binding partner of a protein is known, it is 

relatively straightforward to study how the interaction of 
the binding protein and its binding partner affects 
biological activity. Moreover, one may screen compounds for 
the ability of the compound to competitively inhibit the 

25 formation of the complex, or to dissociate an already formed 
complex. Such inhibitors are likely to affect the 
biological activity of the protein, at least if they can be 
delivered in vivo to the site of the interaction. 

If the binding protein is a receptor, and the binding 

30 partner an effector of the biological activity, then the 

inhibitor will antagonize the biological activity. If the 
binding partner is one which, through binding, blocks a 
biological activity, then an inhibitor of that interaction 
will, in effect, be an agonist. 
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Screening for Modulators of Receptor Activity 

The current state of the art for screening for 
modulators of receptor activity involves the displacement of 
a labeled ligand from the ligand binding pocket of the 
5 receptor. For example, a screen may be for displacement of 
radiolabeled estradiol from the estrogen receptor. This 
assay only provides information concerning the relative 
affinities of the compounds for the receptor and gives no 
indication of the activity of the compound on the receptor, 

10 that is whether it functions as an agonist or an antagonist 
of receptor activity. This is a major problem for 
pharmaceutical companies to overcome in screening for 
modulators of receptor activity. 

The assays that have been developed to date that can 

15 distinguish between agonists and antagonists involve cell- 
based assays and reporter gene systems. McDonnell, et al . , 
Molec. Endocrinol., 9:659 (1995). In these systems, the 
receptor and a reporter gene are co- transf ected into cells 
in culture. The reporter gene is only activated in the 

2 0 presence of active receptor. The ability of a compound to 

modulate receptor activity is determined by the relative 
strength of the reporter gene activity. These assays are 
time consuming and can produce variable results in different 
cell lines or with different reporter genes or response 
25 elements. Thus, the data must be interpreted with caution. 

Methods have been developed that also take advantage of 
the different conformational states of receptors. 
Proteolytic digestion of the estrogen receptor in the 
presence of an agonist or antagonist produces distinct 

3 0 banding patterns on a denaturing polyacrylamide gel. In 

certain conformations, the receptor is protected from 
digestion at a particular site, while a different 
conformation may expose that site. Thus the banding 
patterns may indicate whether the receptor was complexed 
35 with an agonist or antagonist at the time of proteolytic 

digestion. This method requires copious amounts of receptor 
protein and is time consuming and expensive in that it 
requires a gel to be run for each sample. It is not 
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suitable for screening numerous samples . 

The following are examples of patents on cell based 
screening methods : 

Patent #5723291 - Methods for screening compounds for 
5 estrogenic activity- 
Patent #5298429 - Bioassay for identifying ligands for 
steroid hormone receptors 

Patent #5445941 - Method for screening anti- 
osteoporosis agents 
10 Patent #5071773 - Hormone receptor-related bioassays 

Patent #5217867 - Receptors: their identification, 
characterization, preparation and use 

Nuclear Receptors 

Nuclear receptors are a family of ligand activated 

15 transcriptional activators, see Evans and Hollenberg, Cell, 
52:1-3 (1988), factors which include the receptors for 
steroid and thyroid hormones, retinoids, and vitamin D. The 
steroid receptor family is composed of receptors for 
glucocorticoids, mineralocorticoids , androgens, progestins, 

20 and estrogens. These receptors are organized into distinct 
domains for ligand binding, dimerization, transact ivation, 
and DNA binding. Receptor activation occurs upon ligand 
binding, which induces conformational changes allowing 
receptor dimerization and binding of co-activating proteins. 

25 These co-activators, in turn, facilitate the binding of the 
receptors to DNA and subsequent transcriptional activation 
of target genes. In addition to the recruitment of co- 
activating proteins, the binding of ligand is also believed 
to place the receptor in a conformation that either 

3 0 displaces or prevents the binding of proteins that serve as 
co-repressors of receptor function. Lavinsky, et al . , Proc . 
Nat. Acad. Sci . (USA), 95:2920 (1998). 

The estrogen receptor is a member of the steroid family 
of nuclear receptors. Human ER<y is a 595 amino acid protein 

35 composed of six functional domains or regions (A-F) . The 

A/B region contains the transcription function AF-1, and the 
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E domain contains the transcription function AF-2 . These 
functions activate transcription in a cell- and promoter 
context-specific manner. AF-1 is constitutively active, 
while AF-2 is induced by hormone binding to the receptor. 
5 The C region contains the DNA-binding domain and a 

dimerization domain. The DNA-binding domain binds the 
estrogen (receptor) response element (ERE) associated with a 
regulated gene. The DBD contains two zinc fingers. The C 
region may also be responsible for nuclear localization. 

10 The E region contains the hormone (ligand) binding domain. 
The classical ERE is composed of two inverted 
hexanucleotide repeats, and ligand-bound ER binds to the ERE 
as a homodimer. The ER also mediates gene transcription 
from an API enhancer element that requires ligand and the 

15 API transcriptional factors Fos and Jun for transcriptional 
activation. Tamoxifen inhibits transcription of genes 
regulated by a classical ERE, but activates transcription of 
genes under the control of an API element. See Paech, et 
al.. Science, 277:1508-11 (1997). 

2 0 In the absence of hormone, the estrogen receptor 

resides in the nucleus of target cells where it is 
associated with an inhibitory heat shock protein complex. 

(Smith, et al . , (1993) Mol . Endocrinol., 7:4-11.) Upon 
binding ligand, the receptor is activated. This process 
25 permits the formation of stable receptor dimers and 

subsequent interaction with specific DNA response elements 
located within the regulatory region of target genes. 

(McDonnell, et al . (1991), Mol. Cell Biol., 11:4350-4355.) 
The DNA bound receptor can then either positively or 

3 0 negatively regulate target gene transcription. Although the 

precise mechanism by which the ER modulates RNA polymerase 
activity remains to be determined, it has been shown 
recently that agonist bound ER can recruit transcriptional 
adaptors, proteins that permit the receptor to transmit its 
35 regulatory information to the cellular transcriptional 

apparatus. (Onate, et al . (1995), Science, 270:1354-1357; 
Norris, et al . (1998), J, Biol. Chem. , 273:6679-6688; Smith, 
et al . (1997), Mol. Endocrinol., 11:657-666). Conversely, 
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when occupied by antagonists, the DNA bound receptor 
actively recruits co-repressors , proteins that permit the 
cell to distinguish between agonists and antagonists. 
(Norris, et al . (1998); Smith, et al . (1997); Lavinsky, et 
5 al., (1998) Proc. Natl. Acad. Sci . USA, 95:2920-2925). 

Building on this complexity was the recent discovery of a 
second estrogen receptor, ER(3 , whose mechanism of action 
appears to be similar, yet distinct from ERa . (Greene, et 
al . (1986), Science, 231:1150-1154; Kuiper, et al . (1996), 

10 Proc. Natl. Acad. Sci. USA, 93:5925-5930; Mosselman, et al . 
(1996), FEES Lett., 392:49-53). 

Thus, there are two forms of this receptor, a and (3, 
presently known; other forms may exist. Both receptors 
activate transcription in response to estrogens, which are 

15 an important group of steroid hormones that not only 

influence the growth, differentiation, and functioning of 
the reproductive system, but also exert effects in the bone, 
brain and cardiovascular system. Estrogens can produce a 
broad range of effects in this diverse set of target 

20 tissues. These differential effects are believed to be 

mediated, in part, by tissue specific activation of the two 
different transact ivation domains present at the amino- 
terminal and carboxy- terminal regions of the receptor. It 
is also likely that the two forms of the receptor (of and jS) 

25 function in distinct tissues and thereby mediate the 

transact ivation of different subsets of genes. (Paech, et 
al . , Science, 277:1508, 1997; Kuiper and Gustafsson, FEES 
Lett., 410:87, 1997; Nichols, et al., EMEO J., 17:765, 1998; 
Montano, et al . , Mol . Endo . , 9:814, 1995.) 

3 0 Drugs that target the estrogen receptor can exhibit a 

variety of effects in different target tissues. For 
example, tamoxifen is an ER antagonist in breast tissue, 
(Jordan, V.C., (1992) Cancer, 70:977-982), but an ER agonist 
in bone (Love, et al . (1992), New Engl. J. Med., 326:852- 

35 856) and uterine, (Kedar, et al . (1994), Lancet, 343:1318- 

1321) tissue. Raloxifene is also an ER antagonist in breast 
tissue; however, it exerts agonist activity in bone but not 
uterine tissue (Black, et al . (1994), J. Clin. Invest., 
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93:63-69), Indeed, one of the greatest challenges in 
understanding the pharmacology of the estrogen receptor is 
determining how different ER ligands produce such diverse 
biological effects . 
5 Estrogens, in general, are stimulatory agents, 

resulting in increased gene expression and cell 
proliferation in target tissues. However, many molecules 
have been .described that bind to the estradiol binding site 
on the receptor, but produce negative effects on gene 

10 expression and cell growth. These agents have historically 
been termed "antiestrogens " , but this term has proven to be 
much too simplistic. (Tremblay, et al . , Can. Res., 58:877, 
1988; Katzenellenboge , et al . , Breast Can Res. Treatm. , 
44:23, 1997; Howell, Oncology (suppl. 1), 11:59, 1997; Gallo 

15 and Kaufman, Sem. in Oncol. (Suppl. 1), 24:71, 1997). One 
of the most noteworthy of these agents is tamoxifen, which 
has been successfully used in the treatment of ER-positive 
breast cancer. Tamoxifen, a derivative of 

triphenylethylene , is metabolized in the cell to produce 4- 

20 OH tamoxifen, which has very high affinity for the estradiol 
binding pocket of the ER . Although this compound competes 
with estradiol for binding to the ER, it does not induce 
transcriptional activation in breast tissue, thus it does 
not promote cell growth and acts as a classic antiestrogen 

25 in this tissue. Tamoxifen, however, does have estrogen-like 
activities in other tissues. In the uterus, tamoxifen acts 
as an agonist of receptor activity, stimulating the growth 
of uterine tissue leading to an increased incidence of 
endometrial hyperplasia in treated patients. Tamoxifen also 

3 0 produces estrogenic effects in the bone and cardiovascular 
system. This activity generates beneficial effects such as 
reducing the risk of osteoporosis and lowering serum LDIj 
levels. The numerous differential effects produced by 

compounds such as tamoxifen has led to the replacement of 

35 the term "antiestrogen" with "selective estrogen receptor 
modulators" or SERMs . SERMs may have both positive and 
negative effects on ER activity depending on the biology of 
receptor and the tissue in which it is being expressed. 
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A goal of current research is to develop SERMs that 
have agonistic or estrogenic effects on bone and the 
cardiovascular system and antagonistic or antiestrogenic 
effects in the breast and uterus. One SERM that has 
5 recently been approved for treatment of post -menopausal 
symptoms is Raloxifene. Raloxifene is a benzothiophene 
derivative that, like tamoxifen, binds in the ligand binding 
pocket of the ER. Clinical studies indicate that this 
compound lacks estrogenic activity in the breast and uterus, 

10 but produces estrogenic activity in the bone and perhaps the 
cardiovascular system. It is currently prescribed for 
prevention for osteoporosis in post -menopausal women. There 
are several additional SERMs in clinical trials, and a great 
deal of effort in the pharmaceutical industry is focused on 

15 the identification and characterization of additional SERMs. 

The search for SERMs poses a major obstacle. In order 
to screen large libraries of compounds for SERMs, it is 
necessary to have a convenient assay for identifying which 
lead molecules have the desired effect (s). Currently, when 

20 a compound is identified that competes with estradiol for 
binding to the ER, a number of cell -based assays must be 
conducted to determine its activity. These studies are more 
laborious than in vitro assays and still do not absolutely 
predict the complete spectrum of biological activity of the 

25 SERM. Thus, studies often have to move into animal models 
or clinical trials before the selective modes of action of 
the SERM can be determined. A simple in vitro system to 
distinguish between agonist and antagonist activity of a 
SERM would be of great utility. 

30 The development of such a system requires knowledge of 

the mechanisms that produce the broad effects of SERMs. 
There is evidence that SERMs are able to produce 
differential (agonistic and antagonistic) effects due to 
their ability to alter the conformation of the ER. In 

35 general, the receptor is thought of as having two 

conformations, active or inactive. These conformations are 
formed in the presence or absence of ligand, respectively. 
The SERM drives the receptor into a conformation that is 
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neither fully active nor fully inactive. This intermediate 
conformation creates changes in the association patterns of 
co-activators, co-repressors , and other regulatory molecules 
with the receptor, thus producing variable effects. The 
5 broad range of effects produced by SERMs may also be due to 
selective tissue expression of ER alpha and beta as well as 
co-activators and co-repressors . It may also be due to 
different affinities of the SERM for the two receptors. 

Traditional Drug- Screening 

10 In traditional drug screening, natural products 

(especially those used in folk remedies) were tested for 
biological activity. The active ingredients of these 
products were purified and characterized, and then synthetic 
analogues of these "drug leads" were designed, prepared and 

15 tested for activity. The best of these analogues became the 
next generation of "drug leads", and new analogs were made 
and evaluated. 

Both natural products and synthetic compounds could be 
tested for just a single activity, or tested exhaustively 

20 for any biological activity of the interest to the tester. 
Testing was originally carried out in animals, later, less 
expensive and more convenient model systems, employing 
isolated organs, tissue, or cells, or cell cultures, 
membrane extracts or purified receptors, were developed for 

25 some pharmacological evaluations. 

Testing in whole animals and isolated organs typically 
requires large amounts of chemical compound to test. Since 
the quantity of a given compound within a collection of 
potential medicinal compounds is limited, this requires one 

3 0 to limit the number of screens executed. 

Also, it is inherently difficult to establish 
structure/activity relationships (SAR) among compounds 
tested using whole animals, or isolated organs or tissues 
or, to a lesser extent, cultured cells. This is because the 

35 actual molecular target of any given compound's action may 
be quite different from that of other compounds scoring 
positive in the assay. By testing a battery of compounds on 
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a very specific target, one can correlate the action of 
various chemical residues with the quantitative activity and 
use that information to focus ones search for active 
compounds among certain classes of compounds or even direct 
5 the synthesis of novel compounds having a composite of the 
properties shared by the active compounds tested. 

Another disadvantage to whole animal, organ, tissue and 
cell based screening is that certain limitations may prevent 
an active compound from being scored as such. For instance, 

10 an inability to pass through the cellular membrane may 
prevent a potent inhibitor, within a tested compound 
library, from acting on the activated oncogene ras and 
giving a spurious negative score in a cell proliferation 
assay. However, if it were possible to test ras in an 

15 isolated system, that potent inhibitor would be scored as a 
positive compound and contribute to the establishment of a 
relevant SAR. Subsequent, chemical modifications could then 
be carried out to optimize the compound structure for 
membrane permeability. (In the case of cell -based assays, 

20 this problem can be alleviated to some degree by altering 
membrane permeability.) 

Drug Discovery. The human genomics effort could yield 
gene sequences that code for as many as 70,000 proteins, 
each a potential drug target; microbial genomics will 

25 increase this number further. Unfortunately, since genomic 
studies identify genes, but not the biological activity of 
the corresponding proteins, it is likely that many of the 
genes will prove to encode proteins whose activation or 
inactivation has no effect on disease progression. (Gold, 

30 et al., J. Nature Biotech., 15:297, 1997). There is 

therefore a need for a method of determining which proteins 
are most likely to be productive targets for pharmacological 
intervention . 

Even if one knew in advance the perhaps 10,000 proteins 
35 which could be considered interesting targets, there remains 
the problem of efficiently screening hundreds of thousands 
of possible drugs for a useful activity against these 10,000 
targets. 
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Historically, acquiring chemical compound libraries has 
been a barrier to the entry of smaller firms into the drug 
discovery arena. Due to the large quantity of chemical 
required for testing on whole animals and even on cells in 
5 culture, it was a given that whenever a compound was 

synthesized it should be done in fairly large quantity. 
Thus, there was a synthesis and purification throughput of 
less than 50 compounds per chemist per year. Large 
companies maintained their immensely valuable collections as 

10 trade barriers. However, with the downsizing of targets to 
the molecular level and the automation of screens, the 
quantity of a given compound necessary for an assay has been 
reduced to very small amounts. These changes have opened 
the door for the utilization of so-called combinatorial 

15 chemistry libraries in lieu of the traditional chemical 

libraries. Combinatorial chemistry permits the rapid and 
relatively inexpensive synthesis of large numbers of 
compounds in the small quantities suitable for automated 
assays directed at molecular targets. Numerous small 

2 0 companies and academic laboratories have successfully 

engineered combinatorial chemical libraries with a 
significant range of diversity (reviewed in Doyle, 1995, 
Gordon et ai , 1994a, Gordon et al, 1994b) . 

Combinatorial Libraries . In a combinatorial library, 
25 chemical building blocks are randomly combined into a large 
number (as high as 10E15) of different compounds, which are 
then simultaneously screened for binding (or other) activity 
against one or more targets. 

Libraries of thousands, even millions, of random 

3 0 oligopeptides have been prepared by chemical synthesis 

(Houghten et al . , Nature, 354:84-6(1991)), or gene 
expression (Marks et al., J Mol Biol, 222:581-97(1991)), 
displayed on chromatographic supports (Lam et al . , Nature, 
354:82-4(1991)), inside bacterial cells (Colas et al . , 
35 Nature, 380:548-550(1996)), on bacterial pili (Lu, 

Bio/Technology, 13:366-372(1990)), or phage (Smith, Science, 
228:1315-7(1985)), and screened for binding to a variety of 
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targets including antibodies (Valadon et al . , J Mol Biol, 
261:11-22(1996)), cellular proteins (Schmitz et al . , J Mol 
Biol, 260:664-677(1996)), viral proteins (Hong and 
Boulanger, Embo J, 14:4 714-4727(1995)), bacterial proteins 
5 (Jacobsson and Frykberg, Biotechniques , 18:878-885(1995)), 
nucleic acids (Cheng et al . , Gene, 171:1-8(1996)), and 
plastic (Siani et al . , J Chem Inf Comput Sci, 34:588- 
593 (1994) ) . 

Libraries of proteins (Ladner, USP 4,664,989), peptoids 

10 (Simon et al . , Proc Natl Acad Sci USA, 89:9367-71(1992)), 
nucleic acids (Ellington and Szostak, Nature, 
246:818(1990)), carbohydrates, and small organic molecules 
(Eichler et al . , Med Res Rev, 15:481-96(1995)) have also 
been prepared or suggested for drug screening purposes. 

15 The first combinatorial libraries were composed of 

peptides or proteins,, in which all or selected amino acid 
positions were randomized. Peptides and proteins can exhibit 
high and specific binding activity, and can act as 
catalysts. In consequence, they are of great importance in 

20 biological systems. Unfortunately, peptides per se have 

limited utility for use as therapeutic entities. They are 
costly to synthesize, unstable in the presence of proteases 
and in general do not transit cellular membranes. Other 
classes of compounds have better properties for drug 

25 candidates. 

Nucleic acids have also been used in combinatorial 
libraries. Their great advantage is the ease with which a 
nucleic acid with appropriate binding activity can be 
amplified. As a result, combinatorial libraries composed of 

3 0 nucleic acids can be of low redundancy and hence, of high 

diversity. However, the resulting oligonucleotides are not 
suitable as drugs for several reasons. First, the 
oligonucleotides have high molecular weights and cannot be 
synthesized conveniently in large quantities. Second, 

3 5 because oligonucleotides are polyanions, they do not cross 
cell membranes. Finally, deoxy- and ribo-nucleot ides are 
hydrolytically digested by nucleases that occur in all 
living systems and are therefore usually decomposed before 
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reaching the target . 

There has therefore been much interest in 
combinatorial libraries based on small molecules, which are 
more suited to pharmaceutical use, especially those which, 
5 like benzodiazepines, belong to a chemical class which has 
already yielded useful pharmacological agents. The 
techniques of combinatorial chemistry have been recognized 
as the most efficient means for finding small molecules that 
act on these targets. At present, small molecule 

10 combinatorial chemistry involves the synthesis of either 

pooled or discrete molecules that present varying arrays of 
functionality on a common scaffold. These compounds are 
grouped in libraries that are then screened against the 
target of interest either for binding or for inhibition of 

15 biological activity. Libraries containing hundreds of 

thousands of compounds are now being routinely synthesized; 
however, screening these large libraries for binding or 
inhibition with all 10,000 potential targets cannot be 
reasonably accomplished with present screening technologies, 

2 0 and there are numerous experimental and computational 
strategies under development to reduce the number of 
compounds that must be screened for each target . 
Information- intensive drug discovery. As pointed out by 
Paterson, et al . , J. Med. Chem. , 39: 3049-59 (1996), 

25 medicinal chemistry advances through the dual processes of 
"lead discovery" and "lead optimization". In "lead 
discovery", the search objective is the discovery of an 
"activity island", a chemical class with a high frequency 
of active molecules. (this class may be defined 

30 mathematically as a volume within a multidimensional space 
defined by various molecular descriptors). In "lead 
optimization", the "activity island" is explored in detail. 
If each compound synthesized and tested can be considered as 
a probe of a "neighborhood" of similar compounds, in "lead 

35 discovery", it is inefficient to test substances whose 
neighborhoods overlap . 

Coupled to the recent advancements in genomics and 
molecular biology has been a revolution in information 
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technology, which includes relational databases, computer 
graphics, and neural networks (13) . These capabilities 
permit the construction of databases of descriptors that 
describe either compounds or targets in quantitative terms, 
5 and these descriptors can be related to make predictions 
about the structures of compounds, their biological 
activities, and the targets they act on (5-8) . 

Structure descriptors can be based on a variety of 
structural features. These approaches provide arrays of 

10 molecular descriptors that can be used to assess the 
similarity of molecules in a library. 

See Patterson, et al . , et al . , J. Med. Chem. , 39: 
3049-59 (1996), Klebe and Abraham, J. Med. Chem., 36:70-80 
(1993), Cummins, et al . , J. Chem. Inf. Comput . Sci . , 36:750- 

15 63 (1996), Matter, J. Med. Chem., 40:1219-29 (1997); 
Weinstein, et al . , Science, 275:343-9 (1997). 

For proteins, structural descriptors cannot be directly 
calculated from the amino acid sequence. 

Compounds may be characterized by their activity rather 

20 than by structure. Kauvar, et al . , Chemistry & Biology, 2: 
107-118 (1995) "fingerprinted" over 5,000 compounds by the 
binding potency (concentration needed to inhibit 50% of the 
protein's activity) of each compound to each member of a 
reference panel of eight proteins. (These proteins were 

25 selected on the basis of readily assayable activity, broad 
cross-reactivity with small organic molecules, and low 
correlation between each other in binding patterns.) A 
screening library of 54 compounds was then selected based on 
the diversity in their "fingerprints" (inhibitory activity 

30 against the reference panel proteins) . 

This "training set" was used to evaluate the similarity 
of the ligand binding characteristics of a new protein to 
one of the reference panel proteins. By regression 
analysis, a computational surrogate (a weighted sum of two 

35 or more reference panel proteins) for the new protein is 

determined. The activity of all fingerprinted compounds to 
inhibit the activity of the new protein is predicted as the 
sum of their appropriately weighted inhibitory activities 
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against the component reference proteins of the 
computational surrogate. Predictions may be improved by- 
testing additional sets of compounds against the new 
protein. See also L. M. Kauvar, H. O. Villar. Method to 
5 identify binding partners. US Patent 5587293. 

Weinstein, supra, in a study of the molecular 
pharmacology of cancer, took a similar approach. The 
"activity" database (A) contains the activities against 60 
cell lines for 60,000 compounds that have been screened at 

10 NCI. The similarity in the activity profile against the 
panel of cell lines can then be calculated for any two 
. compounds, and is generally assessed by a pairwise 
correlation coefficient (PCC) , which is determined by an 
algorithm called COMPARE, which calculates the similarity of 

15 all of the compounds in the database to a user-supplied 
"seed" compound. 

Hicrh- Throughput Screening 

A high-throughput screening system usually comprises 

(1) suitably arrayed compound libraries, (2) an assay method 
20 configured for automation, (3) a robotics workstation for 

performing the method, and (4) a computerized system for 

handling the data. 

The array may be a standard 96 -well microtitre plate, 

or an array of compounds on chips, beads, agar plates or 
25 other solid support . The array may be a simplex array of 

individual compounds or a complex array in which each 

element is a predetermined mixture of a small number, e.g., 

10-20, different compounds. In the latter case, the mixture 

ultimately must be deconvolved to identify the true active 
3 0 component (s) . 

For ease of automation, the assay should require as few 

steps as possible. Thus, homogeneous assays, which do not 

require fractionations, or more than a single addition of 

reagent, are desirable. 
35 See generally Broach and Thorner, Nature, 384, 14 (Nov. 

7, 1996); Milligan and Rees, Trends Pharmacol. Sci., 20:118- 

24 (1999) . 
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Preferred reporter genes for high- throughput screening 
include bacterial beta-galactosidase , lucif erase, human 
placental alkaline phosphatase, bacterial beta- lactamase , 
and jellyfish green fluorescent protein. 
5 Gonzalez and Negulescu, Curr. Op. Biotechnol . , 9:624-31 

(1998) , discuss intracellular detection assays suitable for 
high- throughput screening. Such assays are conveniently 
provided as optical assays, which may rely on absorbance, 
fluorescence, or luminescence as readouts. While absorbance 
10 assays ave been useful in melanophore and beta-galactosidase 
reporter assays for GPCRs, such assays have relatively low 
sensitivity. To achieve significant absorbance changes, 
very high concentrations of dyes and many cells are 
necessary. Hence, the absorbance assays do not lend 
15 themselves as well to miniaturized formats. 

In contrast, luminescence and fluorescence are more 
sensitive and high S/N ratios are commonplace. 

With regard to chemiluminescence assays, the standard 
substrates are luciferin and aequorin. Since high 

2 0 concentrations of luciferin and ATP are desirable to drive 

lucif erase-catalyzed reactions, the lucif erase assay is 
usually conducted in cell lysates from thousands of cells, 
rather than in intact cells. Membrane- impermable 
luminescent substrates have been used in connection with 
25 extracellular or lysate assays. The greatest advantage of 
chemiluminescence assays is their extremely low background. 

Fluorescence can easily be detected at the single cell 
level. However, the process of exciting fluorescence is not 
absolutely selective; there is a background of unwanted 

3 0 fluorescence and light scattering from endogenous cellular 

and equipment sources . 

Cell-based fluorescence assays fall into three broad 
categories: (1) those based on changes in fluorescence 
intensity, such as those based on the calcium-sensitive 
35 Fluo-3 sensor; (2) those based on energy transfer, such as 
FRET (where there is an energy transfer from a donor 
fluorophore to an acceptor fluorophore when they are in 
close proximity and have a spectral overlap); and (3) those 
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based on energy redistribution (where a tagged molecule 
moves within a cell, and the change in position of the 
fluorescence within the individual cell is observed) . 
The possible signals include Ca, cAMP, voltage, 
5 enzymatic, protein interaction, and transcription. Ca and 
cAMP are both mentioned in the context of GPCR targets. For 
Ca, the suggested readout is Ca indicator dye 
(fluorescence) , Ca photoprotein (luminescence) , a reporter 
gene (fluorescence or luminescence) , and cameleon (FRET) . 

10 For cAMP, the suggested readouts are FlchR (FRET) and a 
reporter gene (fluorescence or luminescence) . 

The authors also comment that other detection methods, 
such as fluorescent polarization, fluorescence correlation 
spectroscopy, and time-resolved detection, which are still 

15 primarily used in biochemical or binding assays, will also 
undoubtedly migrate into cell based assays. 

All references , including any patents or patent 
applications , cited in this specification are hereby 
incorporated by reference . No admission is made that any 

20 reference constitutes prior art. The discussion of the 

references states what their authors assert and applicants 
reserve the right to challenge the accuracy and pertinency 
of the cited document. 
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SUMMARY OF THE INVENTION 

The present specification describes several different, 
although related, inventions: 

Firstly, it relates to a method for the more efficient 
5 identification of small organic molecules, preferably 
molecules having a molecular weight of less than 500 
daltons, which are pharmaceutically acceptable and potent 
modulators of the biological activity of a protein. 

This method provides a simple and consistent means for 

10 identifying and characterizing modulators of receptor 

activity, using 'BioKey' oligomers (especially peptides) to 
probe receptor conformation. It can be used as a tool in 
both primary and secondary screens for compounds that 
modulate the activity of a receptor. In some embodiments, 

15 the method is also completely in vitro so the activity -of a 
compound can be assessed without using a cell based assay, 
let alone a whole animal assay. 

The identification of drugs which modulate the nuclear 
receptors such as the estrogen receptor, and membrane 

20 receptors belonging to the family of G protein coupled 
receptors, are of particular interest. 

Secondly, it relates to a new strategy for the 
identification of modulators of G-protein coupled receptors. 
In this strategy, instead of searching for oligomers which 

25 bind the receptor's ligand binding site, we screen for those 
which inhibit the interaction of the G-alpha subunit . These 
oligomers may be used in the method of the first invention 
above, but may also be used in any screening, diagnostic, or 
therapeutic method in which an agent which modulates a G 

30 protein coupled receptor is useful. 

Thirdly, it relates to various peptides (Example 202) 
which bind G-alpha in either its GDP (D peptides) or GTP . (T 
peptides) bound conformation, as well as to peptides which 
bind G-alpha without regard to its activation state (I 

35 peptides) . These peptides, which are the product of the 

second invention noted above, may be used as conformational 
probes in the practice of the first invention noted above, 
when that invention is applied to the task of identifying 
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modulators of G-protein coupled receptors. Or they may be 
used for any other purpose for which an oligomer of such 
binding characteristics is useful. 

It is of particular interest to use the G-alpha binding 
5 peptides in fluorescent or luminescent assays for agonists 
and antagonists of GPCRs, as these assays lend themselves to 
use in a high- throughput screening assay context. 
Fluorescence assays of the scintillation proximity, 
fluorescence polarization, or FRET formats are of particular 

10 interest (Example 203) . 

Fourthly, it relates to a "one-hybrid" assay (Example 
205) for agonists or antagonists of GPCRs. In this assay, a 
fusion of a T peptide and a membrane -active signalling 
protein is used to detect the activation of the receptor. 

15 Such activation converts the associated G-alpha to the GTP 

bound state, allowing the T peptide to bind. This causes the 
signalling protein to be recruited to the membrane, where it 
becomes active, generating a signal. 

Fifthly, it relates to a "two-hybrid assay" system in 

20 which a G-alpha and a G-alpha binding, activation state- 
specific peptide participate. 

In one form of this assay, the system is used to detect 
agonists or antagonists of the GPCR. For this purpose, the 
G alpha is fused to component A of a signal generating 

25 system, and the G alpha binding peptide to component B. 

The binding of an agonist to the GPCR activates the GPCR, 
causing the G alpha hybrid to bind GTP, whereupon the first 
hybrid is bound by a T-peptide hybrid, thereby bringing A 
and B into proximity, generating a signal. One could 

30 similarly use the D peptide in an assay for antagonist 
activity. Or use both, with different readout systems. 

Another form of this assay is used to detect 
interactions between two particular proteins X and Y. 

Sixthly, it relates to the use of Go? chimeras, with an 

35 N- terminal constant region bound by a set of well 

characterized peptide biokeys, and a variable C- terminal 
region allowing coupling to different receptors, in screens 
for agonists or antagonists of GPCRs normally coupled to a 
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G-alpha other than the one which is the source of the N- 
terminal constant region. 

Finally, the present invention relates to various 
peptides which bind the estrogen receptor in ligand-specif ic 
5 conformations. These peptides may be used as conformational 
probes in the practice of the first invention noted above 
when that invention is applied to the task of identifying 
modulators of the estrogen receptor. Or they may be used 
for any other purpose for which an oligomer of such binding 
10 characteristics is useful. These peptides may be used in a 
manner analogous to the use of the G-alpha binding peptides 
mentioned above, or for other therapeutic and diagnostic 
purposes . 

15 Peptide Binding as a Barometer of Protein Conformation; Use 
of Peptide Panels in the Fingerprinting of Compounds as 
Agonists or Antagonists 

We have explored the possibility that various ER 
ligands induce distinct conformational changes in the ER. 

20 These distinct conformations may, in turn, alter the 

interactions of the receptor with cell and tissue specific 
co-activating or co-repressing proteins or even estrogen 
response elements, thus leading to diverse biological 
effects. Using limited proteolysis, we and others have 

25 shown that the ER agonist estradiol and the ER antagonist 

Imperial Chemical Industries (ICI) 182,780 induced distinct 
ER conformations (McDonnell, et al . (1995), Mol . 
Endocrinol., 9:659-669; Beekman, et al . (1993), Mol. 
Endocrin., 7:1266-1274). However, the picture is much more 

30 complicated than this. There are a variety of ER ligands, 
namely, selective estrogen receptor modulators (SERMS) , 
which are neither pure agonists nor antagonists. These 
ligands, which include tamoxifen and raloxifene, produce 
distinct tissue specific biological effects, yet 

35 conformational differences cannot be discerned in the 

protease digestion assay. It is likely that these compounds 
are also eliciting distinct conformational changes that 
affect ER activity, but the changes are too subtle to be 



21 

detected by the protease digestion assay (Brzozowski , et al . 
(1997), Nature, 389:753-758; Shiau, et al. (1998), Cell, 
95 : 927-937) . 

This invention, in one of its aspects, is based on the 
5 observation that peptides isolated by screening a phage- 
displayed peptide library for binding to the estrogen 
receptor had dramatically different binding affinities 
depending on whether the receptor was unliganded, or 
complexed with an agonist or an antagonist. Thus, the 
10 peptide binding appears to be a barometer of protein 

conformation, and hence of whether a compound which is 
complexed to the receptor is acting as an agonist or an 
antagonist. 

In essence, a panel of "BioKeys" (typically peptides) 

15 which alter the conformation of a receptor in distinctly 

different ways, are used to obtain a "fingerprint" of how a 
compound of interest interact with that receptor in its 
various BioKey-modif ied conformations, each element of the 
fingerprint being a measure of the strength of interaction 

2 0 of the compound with the receptor in the presence of a given 
BioKey. Once fingerprints are obtained for a reasonable 
number of reference compounds with known biological 
activities, as measured by a "gold standard" (whole animal, 
or isolated organ or tissue) assay, the similarity of the 

25 fingerprint of a new compound to that of the reference 
compounds may be calculated, and used to predict the 
bioactivity of the new compound. 

The invention has advantages over the whole animal - 
based systems described above in that 1) the same technology 

30 can be applied to a variety of different receptors, 2) the 
system can be used for high throughput screening and 
compound characterization, and 3) the system gives very 
distinct patterns for agonists and antagonists of receptor 
activity using very little protein. 

35 In the "molecular braille" (MB) embodiment of the 

present invention, the reference and test fingerprints are 
based on in vitro (cell-free) assays. 

In the "cellular-braille" (CB) embodiment of the 
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present invention, the reference and test fingerprints are 
based on cellular assays (but not on assays of whole 
multicellular organisms, or their organs or tissues) . 
The advantages of "molecular braille" are 
5 • gives information about affinity, and, based on a 

fingerprint, bioactivity in a single assay 

# can be faster and less expensive if the protein is 
a) inexpensive to purchase or b) easy to express 
and purify 

10 ® gives information about structure-activity 

relationships 
€> peptide/receptor interactions may be more 

sensitive because there will not be anything 
extraneous to get in the way 

15 Its disadvantages are 

% protein may not be properly folded, modified, or 
be in the presence of cofactors it needs to be 
active 

# doesn't give much of the information given by CB 

20 In contrast, the advantages of "cellular braille" are 

€) If in yeast it can be cheaper than MB 
^ Bioactivity (including dose: effect) information 
@ gives closer indication of how a whole animal 
might respond 

25 # you may get active metabolites 

® no need for protein purification 

Its disadvantages are 

® compounds that cannot get into the cell will 
automatically be selected against 
3 0 • does not give affinity information directly 

® throughput likely to be lower than with MB, 

although still better than whole animal assay. 
Both "molecular braille" and "cellular braille" are 
faster and less expensive than whole animal bioassays, and 
3 5 more readily automated for high throughput, and their use as 
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preliminary screens helps minimize experimentation on 
animals, which itself is an ethical goal of society. 

It will be appreciated that both techniques may be 
used, either sequentially or simultaneously. For example, 
5 MB may be used as a first screen and CB as a second screen 
of the first round positives. Or compounds may be screened 
by both MB and CB, and compounds earmarked by either screen 
given further attention. Similarities may be calculated 
separately from in vitro and cell-based assays, or the 

10 results of these two types of assays may be combined into a 
single fingerprint for each reference or test compound. 

In a preferred embodiment, this method uses phage 
display to isolate peptides (BioKeys) that map the sites of 
biological interaction on both the active and the inactive 

15 receptor. These BioKeys are probes for alterations in 

receptor conformation, and can readily distinguish between 
active, inactive and partially active receptor. The 
patterns of binding obtained with the peptides provides a 
fingerprint of the receptor conformation. The binding of 

20 the individual peptides will increase or decrease in the 
presence of an agonist or an antagonist of receptor 
activity. Such activity may or may not be tissue-specific. 
In some cases, whether a molecule is an agonist or an 
antagonist will depend on the tissue in question {e.g. for 

25 SERMs) , or on other environmental factors. Therefore, the 

peptides may be used to classify compounds, not only as pure 
agonists or antagonists, but also more complexly. The 
method has the following applications: 

1) One or more of these peptides can be used in a 
30 competitive displacement assay to identify modulators of 

receptor activity in a high -throughput (in vitro or simple 
cell) screen. 

2) The peptides can be used to fingerprint 
modulators of receptor activity and classify them as 

35 agonists or antagonists of receptor activity. 

3) Peptides identified for orphan receptors may 
be used to identify the natural ligand of these receptors. 

4) This method may be used for nuclear receptors 
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as well as other receptors such as G-protein coupled 
receptors . 

5) Method can be applied to any protein that 
undergoes a conformational change upon ligand/substrate 
5 binding . 

In a particular preferred embodiment, the invention is 
used to predict SERM activity against nuclear receptors, 
such as the estrogen receptor. 

In order to characterize SERM activity at the estrogen 

10 receptor, we have developed a system that utilizes peptides 
to mimic the binding of various ER associated proteins to ER 
Of and P in an in vitro setting. The peptides bind 
preferentially to either the active or inactive conformation 
of the receptor, and will distinguish between different 

15 conformational changes in the ER that result from the 
binding of a SERM. The system will also allow the 
comparison of effects of the SERM on ER a. and j8 . This assay 
provides a simple procedure to determine the relative 
agonist/antagonist activity of a newly identified SERM. The 

20 technology may also be applied to the analysis of selective 
modulators of any receptor. 

We have developed an in vitro system for identifying, 
characterizing and classifying modulators of receptor 
activity. The technique was developed using the estrogen 

25 receptor and is based on mapping sites of biological 

interaction on the active and inactive receptor using phage 
displayed peptide libraries. The peptides that bind to these 
sites appear to mimic proteins that bind preferentially to 
the active or inactive estrogen receptor. Certain sites on 

3 0 the receptor are only available for binding when an agonist 
is bound to the ER. Other sites are more readily available 
for binding with a SERM complexed ER. The relative binding 
affinities of these peptides on an estrogen complexed 
receptor, or a SERM complexed receptor relative to an 

35 unliganded receptor provides a fingerprint that is 

indicative of the agonist /antagonist activity of the SERM. 
The system has been tested on the ER using several known 
agonists and SERMs . Agonists of receptor function and SERMs 
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produced distinct fingerprints in our system indicative of 
their distinct in vivo functions. This system may be used 
as a primary screening tool to identify hits, to classify 
lead compounds from a drug screen, to characterize SERMs in 
5 terms of agonist and antagonist function and to predict 
possible clinical effects of SERMs such as tissue and 
receptor specificity. This method can also be applied to 
the fractionation of mixtures of SERMs to determine which 
components are producing agonistic and antagonistic 

10 activity. This method may also be used with other receptors 
(e.g., progesterone, androgen, glucocorticoid, thyroid, 
vitamin D, beta-adrenergic, dopamine, epidermal growth 
factor, etc.), to identify, characterize and classify 
modulators of receptor activity. 

15 While peptides have been identified for use as probes 

to modify receptor conformation, to help screen compound 
libraries, certain of these peptides may be useful in their 
• own right as drugs or diagnostics. 

In addition, nonpeptide mimetics or other analogues of 

2 0 the aforementioned peptides may be useful as drugs or 

diagnostics . 

The screened compounds, and their analogues, are also 
of interest . 

Identification of Agonists and Antagonists of GPCRs by Means 
25 of BioKeys vs. Go? Subunits 

This invention also relates to the isolation of BioKeys 
to the Go; subunits of the protein, especially the subunit 
Gia f and their use in screening to identify agonists and 

antagonists for G protein-coupled receptors (GPCR's). 

3 0 Ga is one subunit of a heterotrimeric GTP binding 

protein (G protein) . G proteins are known to bind GTP or 
alternatively GDP at the same binding site. The GTP bound 
form constitutes the active form while the GDP bound form 
constitutes the inactive form of the Go; subunit. Upon 
35 activation of a GPCR by ligand, one of the ways that signals 
are transduced is by altering GPCR's cognate G protein from 
a GDP bound state to a GTP bound state. This allows 
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separation of the alpha subunit from the other two subunits 
that make up the G protein, beta and gamma. Consequently 
the signal is transduced down various cell signaling 
pathways due to the subunit 's interactions with other 
5 proteins. GPCR's are relevant to many diseases, mutations 
have been found that relate to an array of health problems 
including cancer, infertility, asthma, hypertension, and 
endocrine disorders. Gi is one of several G proteins, and 
Giof is its Gof subunit. The G proteins of greatest interest 

10 are Gs, Gi , Gq and G12/13 . 

The BioKeys that have been identified for Gia can be 
categorized into four groups. BioKeys that are specific to 
the GTP bound form of GQf(T peptides), those that are 
specific to the GDP bound form of Ga {B peptides), those that 

15 bind independently of GTP or GDP (I peptides), and those that 
have a weak specificity to either form(BT peptides) . 

G-alpha binding peptides of particular interest are 
those which are not subsequences, or even merely 
conservative substitution mutants of subsequences, which are 

20 found in naturally occurring peptide agonists of antagonists 
of G-alpha, such as melittin, substance P, GP Antagonist -2A, 
MAS 7 and Mastoparan, or in G-beta, G-gamma, or GPCR. 

Prior to the present invention, it was not known or 
suggested that G-alpha would be bound by different peptides 

2 5 depending on whether it was GTP or GDP bound. 

These novel peptides may be used in a variety of high 
throughput screening assays that take advantage of their 
specificity for different forms of the Gof protein to isolate 
novel agonists and antagonists to GPCR's for use as 

30 therapeutics. These include both in vitro assays such as 
SPA assays, FRET assays, FP assays, and TRF assays and in 
vivo assays such as cell based FRET, reconstruction of 
enzyme activity assays, creation of enzyme activity assays, 
and modified two hybrid like assay. Although we have 

35 initially screened GiQf,one may similarly isolate BioKeys to 
all the Gof's and use the resulting BioKeys to screen known 
and orphan GPCR's. 

The resulting BioKeys may, but need not, be used in the 
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aforementioned method for fingerprinting a compound as an 
agonist or antagonist of the GPCR. 
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Use of activation- sensitive G-a.lpha binding peptides in 
monitoring the activation state of a GPCR within a cell 

An alternative use for the aforementioned activation- 



sensitive G-alpha binding peptides is in monitoring the 
activation state of a GPCR within a cell. An I -peptide is 
fused to a first fluorophore and a peptide of a different 
category (D or T) is fused to a different fluorophore, the 

10 two fluorophores being matched for FRET. Both peptides will 
bind to the G-alpha, bringing the fluorophores into 
interactive proximity, only if the G-alpha is inactive (in 
the case of the D embodiment) or active (in the case of the 
T embodiment) . Activation will occur as the result of 

15 agonist action on the cognate GPCR to which the G-alpha in 
question is functionally coupled. Inactivation will 
likewise occur as the result of antagonist action. 

Alternatively, instead of fusing an I -peptide to the 
first fluorophore, one could couple the G-alpha subunit 

20 itself to it. The D or T peptide would bind to the G-alpha 
under the appropriate condition, leading to fluorophore 
energy transfer . 

Use of activation- sensitive G-alpha binding peptides in 
detecting protein-protein interactions . Activation 

2 5 sensitive (D- or T-) G alpha binding peptides can be used in 

assays for protein-protein interactions , 

In this assay, one hybrid comprises a G-alpha and 
protein Y , and the other hybrid comprises a G-alpha binding 
peptide and protein Y. The interaction of the two proteins 
30 X and Y, should it occur, brings the G-alpha and its binding 
peptide into contact . This system of course also includes a 
compatible GPCR and an agonist (if the peptide is a T- 
peptide) or an antagonist (if the peptide is a D-peptide) so 
as to assure that the G-alpha is in the proper conformation. 

3 5 The difference between these two forms is that, in the 

first form, it is not known whether a GPCR agonist or 
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antagonist in present, and the conjugated proteins are known 
to interact; while in the second form, the GPCR agonist or 
antagonist is provided, but it is not known whether the two 
conjugated proteins interact. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Ficrure 1 shows the effect of seven drugs which modulate 
estrogen receptor activity, in five sites of action. 

Figure 2 maps the binding site for interaction of the 
5 four peptides with ERo? (or moieties thereof) as influenced 
by estradiol or 4 -OH tamoxifen. 

Figure 3 shows that different ligands induce different 
structural alterations in ER alpha and ER beta, as shown by 
differences in the binding of 11 different peptides. (The 
10 data in this figure is tabulated in table 14.) 

Figure 4 compares the effects of estradiol (an agonist) 
and raloxifene (an antagonist) on ERalpha conformation. 
f=l Figure 5 shows how the data in Fig. 3 and Table 14A is 

^.3 used to calculate similarities. 

J" 15 Figure 5A duplicates Table 14A. 

'^3 Figure 5B shows the raw Euclidean distances. 

Figure 5C shows the calculated similarities, after 
scaling: 

^, maximum dist - actual dist 

12 20 similarity= 

^ max i mum dist 

=S With 8 descriptors (BioKeys) and scores 0-7, the maximum 

=3 distance is SQRT (7*7*8), or 19.79899. 

''"^ Figure 5D is a 3D bar graph corresponding to 5C . 

25 Figure 5E is a 2D bar graph isolating the similarity 

data for estradiol and 4 -OH tamoxifen. 

Figure 6 shows, in a similar manner, the calculation of 
similarities based on the ERbeta data. 

Figure 7 analyzes the interaction of seven drugs with 
30 ERof and four different peptides (ABl, A2 , AB3 , AB5) using a 
mammalian two-hybrid assay system. 

Figure 8 analyzes the specificity of interaction of 
various drugs with four more nuclear receptors and the same 
four peptides using the same assay system. 
3 5 Figure 9 explores the interaction of the four peptides 

with mutant receptors (impaired AF-2 function) as influenced 
by seven different drugs. 

Figure 10 studies the disruption of ER mediated 
transcriptional activity as a function of peptide 
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concentration. 

Figure 11 shows that the A2 disruption of tamoxifen- 
activated ER is not promoter - dependent . 

Figure 12 explores disruption of ER transcriptional 
5 activity as mediated by the AP-1 pathway. 

Figure 13 is a schematic model of potential mechanisms 
of action of peptides which block tamoxifen partial agonist 
activity . 

Figure 14 shows the normalized luciferase activity of a 
10 two-hybrid mammalian system for ER AF2 in presence of 

estradiol (E2) , 4-OH tamoxifen, ICI, DES, GW 7604, estrone, 
equilin and D8,9DHE. 

Figure 15 shows the binding of various peptides to both 
wild- type and mutant ER. 
15 Figure 16 A shows the disruption of Eof transcriptional 

activation function in mammalian cells as a result of the 
action of LXXLL- containing peptides. B. shows the 
synergistic interaction of two copies of LXXLL motif 
function to compete with endogenous coact ivators . 
20 Figure 17 shows that LXXLL containing peptides disrupt 

AF2 functions in HepG2 cells. 

Figure 18 shows that nuclear receptors have distinct 
preferences for different peptides with LXXLL motifs. 

Figure 19 shows that peptide" 293 selectively disrupts 
25 Erb dependent reporter gene expression without affecting Era 
dependent transcription . 

Figure 2 0 shows a similarity analysis of the data 
pictured in Figure 7 . 

Figure 21 examines interactions among the androgen 
30 receptor, the peptide D30 (Table 101, Class I) , and various 
drugs, by a mammalian two -hybrid assay. The maximum 
distance possible is SQRT (7397 * 7397 * 5) =16, 540.19. 
Similarity was calculated as (maxdist-liganddist ) /maxdist . 
The "73 97" value is the the third item in the Nafox column. 
35 Five is the number of BioKeys in the panel. 
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DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS OF THE INVENTION 

Receptor -Mediated Pharmacolocrical Activity 
Many pharmacologically active substances elicit a 
5 specific physiological response by interacting with an 
element, known as a receptor , of the target cell. A 
receptor is a component, usually macromolecular , of an 
organism with which a chemical agent interacts in some 
specific fusion to case an action which leads to an 
10 observable biological effect. For purposes of the present 
invention, antibodies are not considered receptors. 

Hormones, growth factors, neurotransmitters and many 
other biomolecules normally act through interaction with 
specific cellular receptors. Drugs may activate or block 
15 particular receptors to achieve a desired pharmaceutical 

effect. Cell surface receptors mediate the transduction of 
an "external" signal (the binding of. a ligand to the 
receptor) into an "internal" signal (the modulation of a 
pathway in the cytoplasm or nucleus involved in the growth,. 
20 metabolism or apotosis of the cell) . 

In many cases, transduction is accomplished by the 
following signaling cascade: 

© An agonist (the ligand) binds to a specific protein 
(the receptor) on the cell surface. 
25 ® As a result of the ligand binding, the receptor 
undergoes an allosteric change which activates a 
transducing protein in the cell membrane. 

© The transducing protein activates, within the cell, 
production of so-called "second messenger molecules." 

30 o The second messenger molecules activate certain 

regulatory proteins within the cell that have the potential 
to "switch on" or "off" specific genes or alter some 
metabolic process . 

This series of events is coupled in a specific fashion 

35 for each possible cellular response. The response to a 
specific ligand may depend upon which receptor a cell 
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expresses. For instance, the response to adrenalin in cells 
expressing of-adrenergic receptors may be the opposite of the 
response in cells expressing jS-adrenergic receptors. 

The above "cascade" is idealized, and variations on 
5 this theme occur. For example, a receptor may act as its 
own transducing protein, or a transducing protein may act 
directly on an intracellular target without mediation by a 
"second messenger". 

The substances which are able to elicit the response, 

10 by specific interaction with a receptor site, are known as 
agonists. Typically, increasing the concentration of the 
agonist at the receptor site leads to an increasingly larger 
response, until a maximum response is achieved. A substance 
able to elicit the maximum response is known as a full 

15 agonist, and one which elicits only, at most, a lesser (but 
discernible) response is a partial agonist. 

A pharmacological antagonist is a compound which 
interacts with the receptor without eliciting a response, 
and by doing so inhibits the receptor from responding to 

20 agonists. A competitive antagonist is one whose effect can 
be overcome by increasing the agonist concentration; a 
noncompetitive antagonist is one whose action is unaffected 
by agonist concentration. A sequestering antagonist is one 
which inhibits a ligand: receptor interaction by binding to 

2 5 the ligand in such a way that it can no longer bind the 

receptor. A competitive sequestering antagonist competes 
with the receptor for the ligand, whereas a competitive 
pharmacological antagonist competes with the ligand for the 
receptor . 

3 0 Ligands are substances which bind to receptors, and 

thereby encompass both agonists and pharmacological 
antagonists. However, ligands exist which bind receptors, 
but which neither agonize nor antagonize the receptor . 
Ligands which activate (agonize) or inhibit (antagonize) the 
35 receptor are here collectively termed modulators . Some 

modulators change roles, acting as agonists or antagonists, 
depending on circumstances. 

Natural ligands are those which, in nature, without 
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human intervention, are responsible for agonizing or 
antagonizing a natural receptor. A natural ligand may be 
produced by the organism to which the receptor is native. A 
ligand native to a pathogen or parasite may bind to a 
5 receptor native to a host. Or a ligand native to a host may 
bind to a receptor native to a pathogen or parasite. All of 
these are natural ligands. 

The clinical concept of drug antagonism is broader than 
the pharmacological concept, including phenomena that do not 

10 involve direct inhibition of agonist : receptor binding. A 
"physiological" antagonist Gould be a substance which 
directly or indirectly inhibits the production, release or 
transport to the receptor site of the natural agonist, or 
directly or indirectly facilitates its elimination (whether 

15 physical, or by modification to an inactive form) from the 
receptor site, or inhibits the production or increases the 
rate of turnover of the receptor, or interferes with signal 
transduction from the activated receptor. 



2 0 estrogen receptor) may be a pharmacological antagonist of 

another, e.g., a transcription factor. A physiological 
antagonist of one receptor may be a pharmacological agonist 
of another receptor, such as one which activates an enzyme 
which degrades the natural ligand of the first receptor. 
25 Similarly, one may speak of a physiological agonist, 

which is a substance which directly or indirectly enhances 
the production, release or transport to the receptor site of 
the natural agonist, or directly or indirectly inhibits its 
elimination from the receptor site, or enhances the 

3 0 production or reduces the rate of turnover of the receptor, 

or in some way facilitates signal transduction from the 
activated receptor. 

It follows that there are both "pharmacological" and 
"physiological" modulators, 
35 A functional antagonist of a receptor is a substance 

which acts on a second receptor triggering a biological 
response which counteracts or inhibits the normal response 
to activation of the first receptor. Thus, a functional 



A physiological antagonist of one receptor (e.g 



an 
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antagonist of one receptor may be a pharmacological agonist 
. of another . 

If a disease state is the result of inappropriate 
activation of a receptor, the disease may be prevented or 
5 treated by means of a physiological or pharmacological 
antagonist . Other disease states may arise through 
inadequate activation of a receptor, in which case the 
disease may be prevented by means of a suitable 
physiological or pharmacological agonist . 
10 - An important class of receptors are proteins embedded 

in the phospholipid bilayer of cell membranes. The binding 
of an agonist to the receptor (typically at an extracellular 
f'^, binding site) can cause an allosteric change at an 

%f intracellular site, altering the receptor's interaction with 

ill 15 other biomolecules . The physiological response is initiated 

^.3 by the interaction with this "second messenger" (the agonist 

f^^; is the "first messenger") or "effector" molecule. 

1=:^ Enzymes are special types of receptors. Receptors 

interact with agonists to form complexes which elicit a 
20 biological response. Ordinary receptors then release the 
agonist intact. With enzymes, the agonists are enzyme 
substrates, and the enzymes catalyze a chemical modification 
%1 of the substrate. Thus, enzyme substrates are "ligands". 

Enzymes are not necessarily integral membrane proteins; they 
25 may be secreted, or intracellular, proteins. Often, enzymes 
are activated by the action of a receptor's second 
messenger, or, more indirectly, by the product of an 
"upstream" enzymatic reaction. 

Thus, drugs may also be useful because of their 
30 interaction with enzymes. The drug may serve as a substrate 
for the enzyme, as a coenzyme, or as an enzyme inhibitor. 
(An irreversible inhibitor is an " inact ivator " . ) Drugs may 
also cause, directly or indirectly, the conversion of a 
proenzyme or apoenzyme into an enzyme. Many disease states 
3 5 are associated with inappropriately low or high activity of 
particular enzymes . 

Both agonists and co-activators bind to a receptor, and 
increase its level of activation (signal transduction; 
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enzymatic activity; etc.). However, an agonist binds to a 
ligand binding site which is exposed even in the absence of 
a co-activator. A co-activator binds a receptor only after 
an agonist binds, the receptor, causing a change in 
5 conformation which opens up the co-activator ' s binding site. 
Agonist binding is coact ivator- independent , although the 
coactivator may be necessary to activate the receptor. A 
co-activator may be facultative or obligatory. A co- 
inhibitor competitively inhibits the binding of a co- 

10 activator to the co-activator binding site. 

The present invention may be used to identify agonists, 
antagonists, and coact ivators and coinhibitors , of 
receptors. It is not unusual for a relatively small 
structural change to convert an agonist into a 

15 pharmacological antagonist, or vice versa . Therefore, even 
if the drugs known to interact with a reference protein are 
all agonists, the drugs in question may serve as leads to 
the identification of both agonists and antagonists of the 
reference protein and of related proteins. Similarly, known 

2 0 antagonists may serve as drug leads, not only to additional 
antagonists, but to agonists as well. 



Potency 

The potency of an antagonist of a receptor may be 
expressed as an IC50, the concentration of the antagonist 
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which causes a 50% inhibition of a receptor's binding or 
biological activity in an in vitro or in vivo assay system. 
A pharmaceutically effective dosage of an antagonist depends 
on both the IC50 of the antagonist, and the effective 
5 concentrations of the receptor and its clinically 
significant binding partner (s). 

Potencies may be categorized as follows: 

Category 1C50 

Very Weak >1 fi moles 

10 Weak 100 n moles to 1 fi mole 

Moderate 10 n moles to 100 n moles 

Strong 1 p mole to 10 n moles 

Very Strong <1 p mole 

Preferably, the antagonists identified by the present 
15 invention are in one of the four higher categories 

identified above, and are in any event more potent than any 
antagonist known for the protein in question at the time of 
filing of this application. 

In a similar manner, the potency of an agonist may be 
20 quantified as the dosage resulting in 50% of its maximal 
effect on a receptor. 

General Method 

In the present invention, the biological activity of a 
test substance, as mediated by a particular receptor, in a 
25 particular organism, and thereof is predicted by: 

(I) providing a panel of "Biokeys", the "Biokeys" 
having a differential ability to bind the receptor in the 
presence or absence of one or more ligands, said panel 
therefore being able to discriminate among two or more 

3 0 different receptor conformations, 

(II) screening a set of two or more reference 
substances, which are known pharmacological agonists or 
antagonists of the receptor in one or more organisms and 
tissues, for the ability to alter the binding of the "Biokeys" 

35 to the receptor, thereby obtaining a reference "fingerprint", 
for each reference substance, which is an array of descriptors, 
each descriptor defining, qualitatively or quantitatively, the 
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effect of the reference compound on the binding of a Biokey 
panel member to the receptor. 

(III) The test compound is similarly screened for 
its ability to alter the binding of the "Biokeys" to the 

5 receptor, thereby obtaining a test fingerprint, 

(IV) the similarity of the test fingerprint to each 
of the reference fingerprints is determined, and 

(V) the biological activity of the test substance 
in one or more target organisms, and in one or more target 

10 tissues thereof, is predicted on the basis of the biological 
activities of the reference substances therein, appropriately 
weighted by the similarity between the test substance and the 
reference substance . 

The Biokey panel of step (I) is preferably obtained by 

15 screening the members of a combinatorial library for the 
ability to bind to (a) the unliganded receptor, and (b) a 
liganded receptor. In one embodiment, a combinatorial library 
is first screened against (a) , and then either the whole 
library, or only the unliganded receptor-binding members, are 

2 0 screened against (b) . In another embodiment, the whole library 

is screened against (a) and (b) simultaneously. It is also 
permissible to screen first against (b) and then against (a) . 

Preferably, the combinatorial library is an amplifiable 
combinatorial library, i.e., a library of nucleic acids or 
25 peptides. The members of the Biokey panel may be individual 
molecules, or mixtures of molecules with similar binding 
characteristics . 

It will be appreciated that step (II) need only be 
performed once for a given receptor and that it is not 

3 0 necessary that all reference substances be fingerprinted 

simultaneously. Also, steps (II) and (III) may be 

interchanged . 

In step (IV) , similarity may be determined in a 
qualitative and subjective way, i.e., by "eyeballing" the 
35 fingerprints and judging from experience which is more similar, 
or in a quantitative and objective manner, using the similarity 
measures set forth infra . 

Similarly, in step (V) , the biological activity may be 
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predicted in a qualitative and subjective way, or more 
quantitatively and objectively, by mathematically weighting 
each reference substance's activity scores by the calculated 
similarity of its fingerprint to the fingerprint of the test 
substance . 

By way of example, peptides (BioKeys) that bind to the ER 
can be classified based on their ability to bind to the ER in 
the presence or absence of ER agonists. The different 

affinities of the peptides are due to alterations in receptor 
conformation following binding of an agonist. Since SERMs also 
uniquely alter receptor conformation, it is likely that they 
can affect the binding of the peptides from the different 
classes as well. Each agonist or SERM has associated 

pharmacological effects. For example, estrogen has stimulatory 
activity in breast and uterus, bone and the cardiovascular 
system. Likewise, tamoxifen is stimulatory in the uterus, bone 
and the cardiovascular system, but it has antagonistic effects 
in the breast. The pattern of BioKey binding to the ER in 
response to each compound could be matched with the 
pharmacological effect of each compound. Additionally, a 
comparison between BioKey fingerprints on ER of and (3 will 
supplement the information on agonist and antagonist activity 
and should be predictive of tissue specificity. New estrogen 
agonists and antagonists could then be screened and classified 
based on their BioKey binding pattern to ER a and jS, and 
compounds with a desired tissue-specific activity could be more 
readily identified . 



Hypothetical Table of a "BioKey Fingerprint" for a Hypothetical 
Nuclear Receptor 

Compound A Compound B Compound C Compound D Compound E 

BioKey 1 + + + + + 

BioKey 2 + + + + 

BioKey 3 4- + + 

BioKey 4 + + 

BioKey 5 + 
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Hypothetical Table of Pharmacological Effects of Receptor Modulating Compounds 

Breast Uterus Bone Cardiovascular 

Compound A + -h + + 

Compound B - + - + 

Compound C - + - - 

Compound D + + - 

Compound E + + - + 



For example, using the above tables, compounds with 
unknown pharmacological effects could be characterized by 

10 ''BioKey fingerprinting" to predict their activity in various 
tissues. A compound X, that had a fingerprint similar to 
compound A, would be predicted to have pharmacological effects 
similar to compound A. The "binding or. lack- of binding of a 
specific BioKey with the receptor could indicate activity in 

15 a specific tissue type. In the above examples, binding of 
BioKey 1 to the receptor in the presence of a compound could 
indicate activity in the uterus. Whereas binding of BioKey 5 
to the receptor in the presence of compound could indicate 
activity in bone. 

2 0 Substances 

A "substance" may be either a pure compound, or a mixture 
of compounds. Preferably it is at least substantially pure, 
that is, sufficiently pure enough to be acceptable for clinical 
use. If it is a mixture, then it comprises at least an 
25 effective amount (i.e., able to give rise to a detectable 
biological response in a biological assay) of a biologically 
active compound, or it comprises a substantial amount of a 
compound which is suspected of being biologically active and 
is suitable as a drug lead if so active. 

3 0 Test substances and Drug Leads 

A test substance comprises an effective amount of a 
compound, which is a member of a structural class which is 
generally suitable, in terms of physical characteristics (e.g. , 
solubility) , as a source of drugs and which is not known to 
35 have the pharmacological activity of interest. A drug lead is 
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a former test substance which has either been predicted to have 
desirable pharmacological activity, or in fact has been shown 
to have such activity, and which therefore could serve 
effectively as a starting point for the design of analogues and 
5 derivatives which are useful as drugs. The "drug lead" may be 
a useful drug in its own right, or it may be a substance which 
is deficient as a drug because of inadequate potency or 
undesirable side effects. In the latter case, analogues and 
derivatives are sought which overcome these deficiencies. In 

10 the former case, one seeks to improve the already useful drug. 

Such analogues and derivatives may be identified by 
rational drug design, or by screening of combinatorial or 
noncombinatorial libraries of analogues and derivatives. 

Preferably, a drug lead is a compound with a molecular 

15 weight of less than 1,000, more preferably, less than 750, 
still more preferably, less than 600, most preferably, less 
than 500. Preferably, it has a computed log octanol -water 
partition coefficient in the range of -4 to +14, more 
preferably, -2 to +7.5. 

2 0 A small organic compound library is a library of compounds 

each of which has a molecular weight of less than 1000, and 
which are not peptides or nucleic acids. 

SERMs from distinct structural classes may produce 
fingerprints unique to its class. In addition SERMs from 
25 different classes that have similar biological activities 
should produce similar fingerprints. Numerous SERMs that have 
been identified can be fingerprinted in our system. These 
include steroidal ant iestrogens such as the ICI compounds 
164,3 84 and 182,780, and non-steroidal compounds such as the 

3 0 benzothiophene derivative Raloxifene, and triphenylethylene 

derivatives Toremifene, Droloxifene, TAT-59 and Idoxifene. We 
have found that the steroidal SERMs will produce fingerprints 
distinct from the non-steroidal SERMs (see Example 2) . 
Steroidal compounds such as the ICI compounds have been 
35 categorized as pure anti-estrogens, in that there is no well 
documented evidence of any estrogenic effects in response to 
these compounds. These fingerprints may be similar to the 
unliganded (inactive) receptor, or they may indicate that a co- 
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repressor is bound more tightly or that a co-activator is 
completely inhibited from binding. 

The fingerprinting system should be useful for identifying 
agonistic and antagonistic components from complex mixtures. 
5 The prescription drug Premarin is used for the treatment of 
post-menopausal symptoms. It is a complex mixture derived from 
the urine of pregnant mares. The active components of this 
mixture are not known. Fractionation of Premarin followed by 
fingerprinting of the individual components would indicate 
10 which of the components play an active role in modulating 
estrogen receptor function. It is also likely that components 
of Premarin interact with other nuclear receptors such as the 
progesterone receptor. The effect of these components could 
^,3 be determined as well. 

^^3 15 Reference Ligands 

A reference ligand is a substance which is a ligand for 
%^ a target receptor. Preferably, it is a pharmacological agonist 

f or antagonist of a target receptor protein in one or more 

target tissues of a target organism. However, a reference 
ry 20 ligand may be useful, even if it is not an agonist or 

antagonist, if it alters the conformation of its receptor, 
.J e.g., such that at least some Biokeys which bound the 

unliganded receptor do not bind as well, or bind better, the 
liganded receptor. Preferably, a reference ligand has a 
25 differential effect on Biokeys, so that Biokeys may be 
differentiated on the basis of their interaction with the 
receptor in the presence of the reference ligand. A reference 
ligand may be an agonist of one receptor and an antagonist of 
another. It may also be agonist of a receptor in one tissue 
3 0 and an antagonist of the same receptor in another tissue, or 
in another organism. 

The reference ligand may be, but need not be, a natural 
ligand of the receptor. 

The reference ligands may, but need not, satisfy some or 
35 all of the desiderata set forth above for test substances and 
drug leads . 

If a test substance from one screening becomes a drug 



42 

lead, and that compound, or an analogue thereof, is ultimately 
found to mediate the biological activity of at least one 
receptor in at least one tissue of at least one organism, it 
may be used as a reference ligand in subsequent screenings of 
5 other test substances, and in redefining the Biokey panel. 

Reference Conformation 

When a target receptor is in an unliganded state, it has 

a particular conformation, i.e., a particular 3-D structure. 

When the receptor is complexed to a ligand, the receptor's 
10 conformation changes. If the ligand is a pharmacological 

agonist, the new conformation is one which interacts with other 

components of a biological signal transduction pathway, e.g.; 

transcription factors, to elicit a biological response in the 

target tissue. If the ligand is a pharmacological antagonist, 
15 the new conformation is one in which the receptor cannot be 

activated by one or more agonists which otherwise could 

activate that receptor. 

Each of the conformations of a target receptor which is 

used as a binding target in a binding array is considered a 
20 reference conformation. 

It may be that two different ligands will coincidentally 

cause a receptor to assume the same conformation. However, for 

the purpose of this invention, those will be considered 

different reference conformations because different ligands are 
2 5 involved. 

Biokeys 

For the purpose of the present invention, Biokeys are 
substances whose ability to bind to a target receptor in the 
presence or absence of one or more reference ligands for that 

30 receptor can be used to differentiate the reference ligands, 
and ultimately to calculate the degree of similarity between 
a test substance (having an assayable effect on the binding 
of the Biokeys to the target receptor protein) and reference 
substances (likewise having an assayable effect as such 

35 binding, but whose effect on biological activity of the 
receptor protein in target organisms and tissues of interest 
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is also known) . 

Preferably, Biokeys are members of a combinatorial 
library, and in particular an amplif iable combinatorial library 
such as a peptide or nucleic acid library. The library may 
5 then be screened for binding to various receptor conformations. 
Biokeys need not themselves be suitable as drug leads. 

Biokev Panel 

For the purpose of fingerprinting the reference and test 
substances, a representative selection of Biokeys are collected 
10 into a panel. If only a single reference ligand is known for 
a receptor, the panel could include one or more representative 
members of each of at least two of the following binding 
classes : 

Change in Binding 

15 Class Binds UL-R (Effect of Ligand) 

A + + 

B + 

C + 0 

D _ 0 

20 E _ + 

Thus classes A, B and C bind unliganded receptor (UL-R) , 
but the ligand increases the binding of A, decreases the 
binding of B, and has no effect on the binding of C. Classes 
D and E do not bind the UL-R. The ligand causes E, but not 
25 to bind the receptor. 

Instead of only two of the above, the panel can include 
representative members of three, four or all five of the 
classes, if Biokeys having the appropriate properties can be 
identified. 

30 The above classes look at binding in only a qualitative 

manner. However, it would be possible to differentiate between 
strong and weak binders of UL-R, and between large and small 
changes in binding as the result of the ligand. If desired, 
one could draw even finer divisions, e.g.; strong vs. moderate 

35 vs. weak, etc. 

If more than one ligand is available, the combinatorial 
possibilities are increased, and, if suitable Biokeys can be 
identified, the panel can be expanded appropriately. 





44 

For example, with two ligands, the following possibilities 
could exist 

Biokey UL-R Licrand A Liqand B 

Z + + + 

5 Y + + 0 

X + + 

W + 0 + 

V 4- 0 0 

U + 0 

10 T + - + 

S + - 0 

R + - 

Q - 0 0 

P - + 0 

15 O - 0 + 

N - + + 

And one could discriminate further, e.g., for Z-1, the 

effect of A is greater than that of B, for Z-2, the reverse, 

and for Z-3, the effects are equal. 
20 Preferably, one, two, three, four, five or more reference 

ligands are used to define the Biokey panel. 

It is not necessary that a particular binding class be 

represented by only a single Biokey. Instead, it may be 

represented by a mixture of two or more Biokeys, and indeed the 
25 mixture may correspond to all of the Biokeys in the Biokey 

library which satisfied the binding criteria for the class in 

question. 

The members of the Biokey panel are chosen with a view to 
maximizing the discriminatory power of the panel. For example, 

3 0 to take an extreme case, if two members of the panel have 
identical binding properties, vis-a-vis, all the available 
reference conformations of the receptor, then one of these 
members is redundant. While including it in the panel does no 
harm, it needlessly increases the costs of the screening. 

35 The similarity of any pair of potential panel members may 

be determined using the similarity measures set forth infra . 
The overall diversity of a given panel may be determined by 
computing all of the pairwise dissimilarities. For a given 
size panel, extracted from a given library, one may seek to 

40 maximize the overall diversity of effect on biological 
activity. Or one may seek to determine, for a set of binding 
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members from a library, what is the size and composition of the 
subject which maximizes the ratio of the overall diversity to 
the number of members . 

The number of panel -based descriptors in the fingerprint 
5 will normally be equal to the number of members in the panel. 
The optimal number of members depends on the number of 
reference substances , and the ability of the panel to 
differentiate them. The larger the number of reference 
substances, and the larger the number of target organisms and 

10 tissues in which the biological activity of the reference 
substance is to be predicted, the larger the panel should be. 
Typically, there will be 2 , 3, 4, 5, 6, 7, 8, 9, or 10 panel 
members. More members may be used, but the cost of the assay 
increases, without necessarily providing a commensurate 

15 increase in the predictive power of the data. 

Reference substances 

Reference substances are known pharmacological agonists 
or antagonists for the receptor in question, and have a known 
20 or ascertainable biological activity in one or more organisms 
and/or tissues . 

Typically, for a given receptor, one, two, three, four, 
five or more reference substances will be fingerprinted. 

"Fincrerprintincr" of Test and Reference Substances 
25 Each test substance will be characterized by a plurality 

of descriptors (the "fingerprint") by which it may be compared 
to reference substances . 

These reference substances may be the particular reference 
ligands used to define the Biokey panel, but are not limited 
30 to those reference ligands. Thus, in example 1, only 
estradiol was used to define the five classes of peptides, but 
the reference substances were estradiol, estriol, tamoxifen, 
nafoxidine and clomiphene. The use of estradiol was not 
critical; the reference substances need not include any of the 
35 reference ligands used to define the BioKey panel. 

The reference substances must be pharmacological agonists 
or antagonists in at least one organism and tissue, while the 
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reference ligands are not so limited. 

For the purpose of the present invention, a plurality of 
descriptors must refer to the effect of the test substance on 
the binding of a member of the Biokey panel to a reference 
5 conformation, e.g., unliganded receptor X, receptor X/ligand 
A, receptor X/ligand B, unliganded receptor Y, receptor 
Y/ligand C, etc. Note that in this context, the term "member" 
may refer to a mixture of Biokeys of the same binding class. 
The descriptor may be qualitative (binds vs. nonbinds; 

10 increases vs. decreases vs. no effect, etc.) or quantitative. 
Preferably, at least 2-10 Biokey-based descriptors are used. 

The test substance may additionally be characterized by 
other descriptors, such as structural descriptors, known in the 
art. Preferably, at least 5-10 different reference substances 

15 are "fingerprinted". 

The reference substances will be characterized in a 
similar manner to the test substances, so that their 
descriptors may be "paired" with the test substance descriptors 
in such a manner that the degree of similarity may be 

20 calculated. 

When fingerprinting a given reference or test substance, 
it may be screened simultaneously against all panel members, 
or individual panel members (or subsets of panel members) may 
be tested separately. Also, all reference substances may be 

25 screened simultaneously against a given receptor/panel member 
combination, or the reference substances may be screened 
individually. The same is true of the screening of the test 
substances. The test substances may be screened after, before 
or simultaneously with the reference substances. 

3 0 Descriptors 

A "descriptor" (also known as a parameter, character, 
variable, or variate) is a numerically expressed characteristic 
of a compound (which may be a protein, or a protein ligand) , 
which helps to distinguish that compound from others. A 

35 descriptor value need not be absolutely specific to a compound 
to be useful. The characteristics may be pure structural 
characteristics (as in a "structural descriptor") or they may 
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refer to the compound's interaction with other compounds. 
"Paired Descriptors" are descriptors of the same property as 
measured in two different molecules, A "descriptor array", 
"list", or "set" is an array, list or set whose elements are 
5 different descriptors for the same molecule. Such an array, 
list or set is referred to herein as a "fingerprint". 

A plurality of paired descriptors for two compounds may 
be used to calculate a similarity between the two compounds. 

Similarity Measures 

10 A similarity measure or coefficient quantifies the 

relationship between two individuals (compounds) , given the 
values of a set of variates (descriptors) common to both. 
Similarity coefficients are usually defined to take values in 
the range of 0 to 1 . 

15 One commonly used measure of similarity is the product 

moment correlation coefficient. Its correlation is unity 
whenever two profiles are parallel, regardless of how far apart 
they are in level. Two profiles may have correlation of +1 
even if they are not parallel, provided that the two sets of 

20 scores are linearly related. 

For binary descriptors, the simplest measure of similarity 
is the simple matching coefficient 

Sij = number of matches 

number of comparisons 

25 The Jaccard or Sneath coefficient modifies the simple 

matching coefficient by ignoring bits which in both i and ± are 
zero, i.e., by ignoring negative matches (mutual absences) . 
In other words, it is obtained by dividing the number of bits 
which are set in both descriptor bit strings, and dividing by 

30 the total number of bits set in either descriptor string. It 
is also called the unweighted Tanimoto coefficient. 

The weighted Tanimoto coefficient for descriptors k and 
individuals i and j. is: 

35 k 

S,,= 

Ew„Xik+Sw3,Xji,-EWkXikXjk 
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k k k 

Gower has defined a general similarity coefficient which 
can be used for binary, qualitative, and quantitative data: 

P P 

5 Sij=ESijk/EWij,^ for individuals ± and j_ and descriptor 

k=l k-1 

W^-jk is set to 1 if the comparison is valid for 
variable k, and to 0 otherwise. If w^^^=0 , then s^^^ is 0. For 
10 binary data, w^jj^ and Sij^ are both 0 if the variable is negative 
in both individuals. The Sij^ is positive only if the binary 
variable is positive for both individuals. For qualitative 
data, Sijj^=l if the individuals are the same for the kth 
3 character, and Sijk=0 if they differ. For quantitative data, 

;^ 15 Sij]^=l- I Xik-X-jk I /Rk where Xi^^ is the value of descriptor k for 

^5 individual i, and is the total range of variable k. 

IJ Descriptors may be quantitative or qualitative. 

12 Quantitative descriptors may be integers or real numbers. 

« Qualitative descriptors divide the data into categories which 

IZ 2 0 may be, but need not be, expressible as having relative 

fy magnitudes. Binary descriptors are a special case of 

^2 qualitative descriptors, in which there are just two 

categories, typically representing the presence or absence of 
a feature. Qualitative data for which the variates have 
25 several levels may be treated like binary data with each level 
of a variate being regarded as a single binary variable (i.e., 
an eight level variate expressed as eight bits) . Or the levels 
may be numbered sequentially (i.e., an eight level variable 
expressed as three bits) . 
30 A set of n-descriptors defines an n-dimensional descriptor 

space; each compound for which a descriptor set is available 
may be said to occupy a point in descriptor space. The 
dissimilarity of two compounds may be expressed as a distance 
between the two points which they occupy in descriptor space. 
35 A distance measure is a similarity measure which is also 

a metric, i.e., satisfies the conditions (i) d(x,y) sO; and 
d(x,y)=0 if x=y; (ii) d (x, y) +d (y , x) ; and (iii) 
d (x, z) +d (y, z) sd (x, y) (the metric or triangular inequality). 
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Of course, the greater the distance, the less the similarity. 

Distances may be calculated on the basis of any of a 
variety of distance measures known in the statistical arts. 

The most commonly used distance measure is the Euclidean 
5 metric : 



k 

It corresponds most closely to our intuitive sense of 
distance . 

10 The absolute, city block, or Manhattan metric is 

dij=^ I Xik-Xjk I 
k 

Its rationale is that if the variables have scale units 
of equal value, the . entities should have the same distance 
15 whether two units apart on each of two variables, or one unit 
apart on one and three on the other. 

The "cosine theta" distance is the cosine of the angle 
between the vector from the origin to point X^j^ and the vector 
from the origin to point Xjj^. 
20 A generalized distance measure is the Minkowski metric: 

di,= (E|X,,-XjJ^)^/- 
k 

which is a Euclidean metric for r=2 and a city block 
metric for r=l . 

25 The Mahalonobis distance measure (D^) is of the form 

dij= (Xi-Xj) ' E-MXi-Xj) 
where E is the pooled-within-groups variance-covariance 
matrix, and X^ and Xj are the vectors of scores for entities i 
and j_. The Mahalanobis distance allows for correlations 
3 0 between variables; if the variables are uncorrelated, is 
equivalent to Euclidean distance measured using standard 
variables . 

The Canberra metric, given below, has the advantage of 
being unaffected by the range of the variable: 

35 d(i,j) = E (|Xj3, - X^^\)/{X,^ + Xj,,)). 

k 

A modified form, which accommodates negative states. 
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is 

d(i,j) = E dXjj, - XiJ/(|X,J + |XjJ)). 
k 

The Calhoun distance uses only rank orders; for molecules 
5 i and j_, the distance is the proportion of the entire set 
(excluding i and j_) that have descriptor states intermediate 
between that for i and that for j_ for one or more of the 
descriptors k. 

A distance measure may be transformed into a similarity 
10 measure by any of a variety of transformations that convert a 
non-negative number to the range 0..1, e.g., 

Sij = l/(l+dij) 

A similarity measure may be converted into a distance by, 
e.g. , dij = 1-Sij . 

15 If there is a theoretical maximum distance (dt-^ax) / based 

on the theoretically possible ranges for each of the component 
descriptors, the similarity may be expressed as 
Sij = l- (dij/dt^^^) 

Alternatively, one may calculate the distances between all 

2 0 pairs, and then use the actual maximum distance (d^n^ax) : 

Sij = l- {d^j/d^m^J 

Instead of using the ratio of the actual distance to the 
actual or theoretical maximum distance, one may express s^j as 
the fraction of the pairs for which the distance is greater 
25 than or equal to d^j . This is a measure of relative 
similarity . 

Descriptors may be weighted (or otherwise transformed) for 
any of several reasons, including: 

(a) to reflect the perceived value of the 

3 0 descriptor for determining whether two proteins 

will be modulated by structurally similar 
drugs ; 

(b) to reflect the perceived reliability of the 
descriptor data; 

35 (c) to correct for differences in scale between 

descriptors, so that a descriptor does not 
dominate a similarity or distance calculation 
merely because its values are of higher 
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magnitude or are spread over a greater range; 
and 

(d) to correct for correlations between 
descriptors . 

5 The raw descriptor values may be, but need not be, 

transformed prior to use in calculating distances. Typical 
transformations are (a) presence (1) /absence (0), (b) ln(x+l), 
(c) frequency in sample, (d) root, and (e) relative range, 
i.e., (value-min) / (max-min) . 

10 The raw descriptor values may be standardized (normalized) 

to have zero mean (x'=x-/Xj,) and/or unit variance (x'=x/a^), 
possibly both (x' = (x-zx,^) /a^) or they be standardized (unitized) 
to fall into the range 0 to 1. 

Descriptor weights may be adjusted empirically on the 

15 basis of specially designed test sets. A training set of 
proteins is identified. Descriptors are evaluated for each 
protein in the set. A training set of compounds, including 
are also tested against each compound in the set. These 
compounds are chosen so that, for any protein in the set, there 

20 is at least one compound which is an agonist or antagonist for 
it. A neural net, with the descriptor weights as inputs, is 
used to predict the activity of each compound against each 
protein, using the calculated protein similarities. For 
example, it will calculate the similarity of protein x to all 

25 other proteins, then treat the activities of the compounds 
against the other proteins as "knowns" and use it to predict 
the activity of the compounds against protein x. This is done 
repeatedly, with each protein taking on the role of protein x, 
in turn. 

30 The coefficient of variation may be useful in comparing 

descriptors; it is the standard deviation divided by the mean. 
If there is no information available about the ultimate 
significance of a descriptor, one may give a greater weight to 
descriptors which have a larger CV and hence a more uniform 

35 distribution. 

It must be emphasized that we do not require use of 
weighted descriptors, let alone of any particular method of 
deriving weights. 
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It is likely that some degree of correlation will exist 
among the descriptors. Standard mathematical methods, such as 
cluster analysis, principal components analysis, or partial 
least squares analysis, may be used to determine which 
5 descriptors are strongly correlated and to replace . them with 
a new descriptor which is a weighted sum of the original 
correlated descriptors. One may alternatively choose (perhaps 
randomly) one of each pair of highly correlated descriptors and 
simply prune it, thereby reducing the amount of data which must 
10 be collected. 

One way of correcting for correlation among the 
descriptors is for each descriptor m, calculate the averacre of 
its squared correlation coefficients with all descriptors n 
^,3 (including m=n, for which the coefficient is necessarily 

l^! 15 unity) , and subtract this number from one to obtain a weight 

,1,3 representing the fraction of the variation in descriptor m 

J=^; which is not explained by the "average" descriptor n. With 

ij. this "average r^" method, if we have four descriptors, and two 

" are perfectly correlated to each other, and the descriptors are 

2 0 otherwise completely uncorrelated, the correlated descriptors 
fy will have weights of 0.5 each, and the other two will have 

weights of 1.0 each. 

The diversity of a set of compounds, as measured by a set 
of descriptors, may be calculated in several ways. 
25 A purely geometric method involves assuming that each 

compound sweeps out a hypersphere in descriptor space, the 
hypersphere having a radius known as the similarity radius. 
The total hypervolume in descriptor space of points within a 
unit similarily radius of one or more of the compounds is 

3 0 calculated. This is compared to the hypervolume achievable if 

none of hypersphere ' s overlap; i.e., to n * volume of a single 
hypersphere, where n is the number of compounds in the set. 
The swept hypervolume may be determined exactly, or by Monte 
Carlo methods. The ratio of the swept hypervolume to the 
35 maximum hypervolume is a measure of compound set diversity, 
ranging from 1 (maximum) to l/n (minimum) . 

Another approach is to calculate all of the pairwise 
distances between compounds in descriptor space. The mean 
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distance is a measure of diversity. If desired, this can be 
scaled by calculating the ratio of the mean distance to the 
maximum theoretical distance. 

A third approach is to apply cluster analysis to the set 
5 of compounds. The method used should be one which does not set 
the number of clusters arbitrarily, but rather decides the 
number based on some goodness-of - f it criterion. The resulting 
number of cluster is a measure of diversity, as is the ratio 
of the number of clusters to the number of compounds. 
10 One may calculate a measure of disorder for a descriptor 

as 

m^ 

H(k) = - E Pj,g m Pj,g 

g=l 

15 where m^ is the number of different states in descriptor 

k, and P^g is the observed proportion of individuals exhibiting 
state g for descriptor k. For uncorrelated descriptors, the 
sum of H(k) for all k is a measure of overall diversity. 
Standard techniques may be used to correct for correlation. 

2 0 Target Receptor 

The target receptor may be a naturally occurring 
substance, or a subunit or domain thereof, from any natural 
source, including a virus, a microorganism (including 
bacterial, fungi, algae, and protozoa), an invertebrate 

25 (including insects and worms) , or the normal or cancerous cells 
of a vertebrate (especially a mammal, bird or fish and, among 
mammals, particularly humans , apes, monkeys, cows, pigs, goats, 
llamas, sheep, rats, mice, rabbits, guinea pigs, cats and 
dogs) . (Usually it is a protein; it may be a nucleic acid. 

30 References to proteins apply, mutatis mutandis , to nucleic 
acids, lipids, carbohydrates and other macromolecules which can 
act as receptors.) Alternatively, the receptor protein may be 
a modified form of a natural receptor. Modifications may be 
introduced to facilitate the labeling or immobilization of the 

35 target receptor, or to alter its biological activity (An 
inhibitor of a mutant receptor may be useful to selectively 
inhibit an undesired activity of the mutant receptor and leave 
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other activities substantially intact) . In the case of a 
protein, modifications include mutation (substitution, 
insertion or deletion of a genetically encoded amino acid) and 
derivatization (including glycosylation, phosphorylation, and 
5 lipidation) . The target may a chimera of two receptors, e.g., 
a mammalian and a yeast receptor, or two receptors of different 
functions, so as to combine the ligand binding function of one 
receptor with the signal transduction function of another. 

A target receptor may be, inter alia , a glyco-, lipo-, 
10 phospho-, or metalloprotein . It may be a nuclear, cytoplasmic, 
membrane, or secreted protein. It may, but need, not, be an 
enzyme . 

The target receptor, instead of being a protein, may be 
a macromolecular nucleic acid, lipid or carbohydrate. If a 

15 nucleic acid, it may be a ribo- or a deoxyribonucleic acid, and 
it may be single or double stranded. It may, but need not, 
have enzymatic activity . 

The target receptor need not be a single macromolecule , 
rather, it may be a complex of a macromolecule with one or more 

20 additional molecules, especially macromolecules . Examples" 
includes ribosomes (RNA: protein complexes) , polysomes 
(mRNA: ribosome complexes) , and chromatin (DNA:protein 
complexes) . For use of polysomes as binding molecules (or as 
display systems), see Kawasaki, USP 5,643,768 and 5,658,754; 

25 Gersuk, et al . , Biochem. Biophys . Res. Comm. 232:578 (1997); 
Mattheakis, et al . , Proc . Nat. Acad. Sci . USA, 91:9022-6 
(1994) . 

The known binding partners (if any) of the target receptor 
may be, inter alia , proteins, oligo- or polypeptides, nucleic 
3 0 acids, carbohydrates, lipids, or small organic or inorganic 
molecules or ions. 

The functional groups of the receptor which participate 
in the ligand-binding interactions together form the ligand 
binding site, or paratope, of the receptor. Similarly, the 
35 functional groups of the ligand which participate in these 
interactions together form the epitope of the ligand. 

In the case of a protein, the binding sites are typically 
relatively small surface patches. The binding characteristics 
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of the protein may often be altered by local modifications at 
these sites, without denaturing the protein. 

While it is possible for a chemical reaction to occur 
between a functional group on a receptor and one on a ligand, 
5 resulting in a covalent bond, receptor protein-ligand binding 
normally occurs as a result of the aggregate effects of several 
noncovalent interactions. Electrostatic interactions include 
salt bridges, hydrogen bonds, and van der Waals forces. 

What is called the hydrophobic interaction is actually the 

10 absence of hydrogen bonding between nonpolar groups and water, 
rather than a favorable interaction between the nonpolar groups 
. themselves. Hydrophobic interactions are important in 

stabilizing the conformation of a receptor protein and thus 
indirectly affect ligand binding, although hydrophobic residues 

15 are usually buried and thus not part of the binding site. 

The receptor may have more than one paratope and they may 
be the same or different. Different paratopes may interact 
with epitopes of different binding partners. An individual 
paratope may be specific to a particular binding partner, or 

20 it may interact with several different binding partners. A 
receptor can bind a particular binding partner through several 
different binding sites. The binding sites may be continuous 
or discontinuous (e.g., vis-a-vis the primary sequence of a 
receptor protein) . 

25 A list of agonists, antagonists, radioligands and 

effectors for many different receptors appears in Appendix 1 
of King, Medicinal Chemistry: Principles and Practice , pp. 290- 
294 (Royal Soc'y Chem. 1994) . Appendix II lists blockers for 
various ion channels (which are another special type of 

3 0 receptor) . Some receptors, and their agonists and/or 

antagonists, are listed in Table A. 

Any nuclear receptor, such as receptors for progestins, 
androgens, glucocorticoids, thyroid hormones, retinoids, 
vitamin D3 and mineralocort icoids could be used in this 

35 fingerprinting system. Affinity selection of peptide libraries 
could be used to identify peptide sequences that bind in the 
presence or absence of agonist as described above. The 
peptides could then be used in the manner described above to 
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classify and characterize modulators of the receptor's 



likely to interact with the progesterone receptor. A system 
for fingerprinting the progesterone receptor may be developed 
5 to test for active components of Premarin. 

As an example of a non-protein receptor, we cite DNA. DNA 
can undergo conformational changes when it is bound for 
example, by a transcription factor or small molecule. For 
example, the antitumor agent cisplatin binds to and alters the 

10 structure of DNA. The altered structure attracts a cellular 
protein containing an HMG box (high mobility group) . The 
protein is believed to sterically block the repair of the 
cisplatin lesion on the DNA and contribute to the effectiveness 
of cisplatin in the treatment of certain types of cancer. 

15 BioKeys could be identified that bind specifically to DNA in 
certain conformations. These Biokeys could be used to identify 
conformational changes that take place in the DNA upon binding 
of a small molecule or protein. 

Target Orcranism 

20 A purpose of the present invention is to predict the 

biological activity in one or more target tissues, as hereafter 
defined, of a target organism. 

The target organism may be a plant, animal, or 
microorganism. The plant or animal may be normal, chimeric or 

25 transgenic. It may or may not be infected with a pathogen 
(e.g., virus) or a parasite. It may be in a normal or an 
abnormal environmental state. It may be of a particular 
developmental stage, size, sex, etc. 



3 0 which case the drug may be intended to increase the disease, 
weather or pest resistance, alter the growth characteristics, 
or otherwise improve the useful characteristics or mute 
undesirable characteristics of the plant. Or it may be a weed, 
in which case the drug may be intended to kill or otherwise 

35 inhibit the growth of the plant, or to alter its. 
characteristics to convert it from a weed to an economic plant. 
The plant may be a tree, shrub, crop, grass, etc. The plant 



activity . 



As described above, components of Premarin are 



In the case of a plant, it may be an economic plant, in 




57 



may be an algae (which are in some cases also microorganisms) , 
or a vascular plant, especially gymnosperms (particularly 
conifers) and angiosperms. Angiosperms may be monocots or 
dicots. The plants of greatest interest are rice, wheat, corn, 
5 alfalfa, soybeans, potatoes, peanuts, tomatoes, melons, apples, 
pears, plums, pineapples, fir, spruce, pine, cedar, and oak. 

If the target organism is a microorganism, it may be 
algae, bacteria, fungi, or a virus (although the biological 
activity of a virus must be determined in a virus- infected 
10 cell) . The microorganism may be human or other animal or plant 
pathogen, or it may be nonpathogenic. It may be a soil or 
water organism, or one which normally lives inside other living 
things . 

If the target organism is an animal, it may be a 

15 vertebrate or a nonvertebrate animal. Nonvertebrate animals 
are chiefly of interest when they act as pathogens or 
parasites, and the drugs are intended to act as a biocidic or 
biostatic agents. Nonvertebrate animals of interest include 
worms, mollusks, and arthropods. 

20 The target organism may also be a vertebrate animal, i.e., 

a mammal, bird, reptile, fish or amphibian. Among mammals, the 
target animal preferably belongs to the order Primata (humans, 
apes and monkeys), Artiodactyla (e.g., cows, pigs, sheep, 
goats, horses), Rodenta (e.g., mice, rats) Lagomorpha (e.g., 

25 rabbits, hares) , or Carnivora (e.g., cats, dogs) . Among birds, 
the target animals are preferably of the orders Anseriformes 
(e.g., ducks, geese, swans) or Galliformes (e.g., quails, 
grouse, pheasants, turkeys and chickens) . Among fish, the 
target animal is preferably of the order Clupeiformes (e.g., 

30 sardines, shad, anchovies, whitefish, salmon). 

Target Tissues 

The term "target tissue" refers to any whole animal, 
physiological system, whole organ, part of organ, miscellaneous 
tissue, cell, or cell component (e.g., the cell membrane) of 
35 a target animal in which the biological activity of a drug may 
be measured. 

Routinely . in mammals one would chose to compare and 
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contrast the biological impact on virtually any and all tissues 
which express the subject receptor protein. The main tissues 
to use are: brain, heart, lung, kidney, liver, pancreas, skin, 
intestines, adrenal glands, breast, prostate, vasculature, 
5 retina, cornea, thyroid gland, parathyroid glands, thymus, bone 
marrow etc. 

Another classification would be by cell type: B cells, 
T cells, macrophages, neutrophils, eosinophils, mast cells, 
platelets, megakaryocytes, erythrocytes, bone marrow stomal 

10 cells, fibroblasts, neurons, astrocytes, neuroglia, microglia, 
epithelial cells (from any organ, e.g. skin, breast, prostate, 
lung, intestines etc), cardiac muscle cells, smooth muscle 
cells, striated muscle cells, osteoblasts, osteocytes, 
chondroblasts , chondrocytes, keratinocytes , melanocytes, etc. 

15 The "target tissues" include those set forth in Table B. 

Of course, in the case of a unicellular organism, there is no 
distinction between the "target organism" and the "target 
tissue " . 

In Vitro vs . In Vivo Assays 

20 The term "in vivo" is descriptive of an event, such as 

binding or enzymatic action, which occurs within a living 
organism. The organism in question may, however, be 

genetically modified. The term "in vitro" refers to an event 
which occurs outside a living organism. Parts of an organism 

25 (e.g., a membrane, or an isolated biochemical) are used, 
together with artificial substrates and/or conditions. For the 
purpose of the present invention, the term in vitro excludes 
events occurring inside or on an intact cell, whether of a 
unicellular or multicellular organism. 

30 In vivo assays include both cell-based assays, and 

organismic assays. The term cell-based assays includes both 
assays on unicellular organisms, and assays on isolated cells 
or cell cultures derived from multicellular organisms. The 
cell cultures may be mixed, provided that they are not 

35 organized into tissues or organs. The term organismic assay 
refers to assays on whole multicellular organisms, and assays 
on isolated organs or tissues of such organisms . . _ ^ . 
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"Biological assays" include both in vivo assays, and in 
vitro assays on subcellular multimolecular components of cells 
such as membranes . 

Preliminarv Screenincr Assays 
5 The invention contemplates at least three occasions for 

preliminary screening : 

(a) screening for potential "BioKeys", using a known 
receptor and one or more known pharmacological modulators 
of the receptor (see General Method step (I)), 
10 (b) screening reference compounds, having a known 

receptor-mediated bioactivity using a known receptor and 
an established BioKey panel , to obtain reference 
fingerprints (see General Method, step (II), and 
(c) screening test compounds for their ability to alter 
15 the binding of a panel of BioKeys to the receptor, 

thereby obtaining a test fingerprint (see General Method, 
step (III) ) . 

The same or different screening methods may be used on 
each occasion . 

20 Preliminary, screening assays will typically be either in 

vitro (cell-free) assays (for binding to an immobilized 
receptor) or cell-based assays (for alterations in the 
phenotype of the cell) . They will not involve screening of 
whole multicellular organisms, or isolated organs. The 

25 comments on biological assays apply mutatis mutandis to 
preliminary screening cell-based assays. 

Pi aqno s t i c As s ays 

While preliminary screening assays are used to determined 

the activity of a compound of uncertain activity, diagnostic 
3 0 assays employ a binding molecule of known binding activity, or 

a conjugate or derivative thereof, as a diagnostic reagent. 

For the purpose of the discussion of diagnostic methods 

and agents which follows, the "binding molecule" may be a 

peptide, peptoid or peptidomimetic of the present invention, 
35 or an oligonucleotide of the present invention, which binds the 

analyte or a binding partner of the analyte . The analyte is 
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a target protein. 

In Vitro Assay Methods and Reagents 

In vitro assays may be diagnostic assays (using a known 
binding molecule to detect or measure an analyte) or screening 
5 assays (determining whether a potential binding molecule in 
fact binds a target) . The format of these two types of assays 
is very similar and, while the description below refers to 
diagnostic assays for analytes, it applies, mutatis mutandis, 
to the screening of molecules for binding to targets. The in 

10 vitro assays of the present invention may be applied to any 
suitable analyte-containing sample, and may be qualitative or 
quantitative in nature. In order to detect the presence, or 
measure the amount, of an analyte, the assay must provide for 
a signal producing system (BPS) in which there is a detectable 

15 difference in the signal produced, depending on whether the 
analyte is present or absent (or, in a quantitative assay, on 
the amount of the analyte) . The detectable signal may be one 
which is visually detectable, or one detectable only with 
instruments. Possible signals include production of colored 

20 or luminescent products, alteration of the characteristics 
(including amplitude or polarization) of absorption or emission 
of radiation by an assay component or product, and 
precipitation or agglutination of a component or product. The 
term "signal" is intended to include the discontinuance of an 

25 existing signal, or a change in the rate of change of an 
observable parameter, rather than a change in its absolute 
value. The signal may be monitored manually or automatically. 

The component of the signal producing system which is most 
intimately associated with the diagnostic reagent is called the 

30 "label". A label may be, e.g., a radioisotope, a fluorophore, 
an enzyme, a co-enzyme, an enzyme substrate, an electron-dense 
compound, or an agglutinable particle. One diagnostic reagent 
is a conjugate, direct or indirect, or covalent or noncovalent, 
of a label with a binding molecule of the invention. 

35 The radioactive isotope can be detected by such means as 

the use of a gamma counter or a scintillation counter or by 
autoradiography. Isotopes which are particularly useful for 
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the purpose of the present invention are ^H, ^^^I, ^^^I, ^^S, ^^C, 
and, preferably, ^^^I . 

It is also possible to label a compound with a fluorescent 
compound. When the f luorescently labeled antibody is exposed 
5 to light of the proper wave length, its presence can then be 
detected due to fluorescence. Among the most commonly used 
fluorescent labelling compounds are fluorescein isothiocyanate , 
rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o- 
phthaldehyde and f luorescamine . 

10 Alternatively, fluorescence-emitting metals such as ^^^Eu, 

or others of the lanthanide series, may be attached to the 
binding protein using such metal chelating groups as 
.diethylenetriaminepentaacetic acid (DTPA) of ethylenediamine- 
tetraacetic acid (EDTA) . 

15 The binding molecules also can be detectably labeled by 

coupling to a chemiluminescent compound. The presence of the 
chemiluminescent compound is then determined by detecting the 
presence of luminescence that arises during the course of a 
chemical reaction after a suitable reactant is pJrovided. 

20 Examples of particularly useful chemiluminescent labeling 
compounds are luminol, isolumino, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
the binding molecule. Bioluminescence is a type of 

25 chemiluminescence found in biological systems in which a 
catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent 
protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for purposes 

30 of labeling are luciferin, lucif erase and aequorin. 

Enzyme labels, such as horseradish peroxidase and alkaline 
phosphatase, are preferred. When an enzyme label is used, the 
signal producing system must also include a substrate for the 
enzyme. If the enzymatic reaction product is not itself 

35 detectable, the SPS will include one or more additional 
reactants so that a detectable product appears. 

Assays may be divided into two basic types, heterogeneous 
and homogeneous. In heterogeneous assays, the interaction 
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between the affinity molecule and the analyte does not affect 
the label, hence, to determine the amount or presence of 
analyte, bound label must be separated from free label. In 
homogeneous assays, the interaction does affect the activity 
5 of the label, and therefore analyte levels can be deduced 
without the need for a separation step. 

In general, a target -binding molecule of the present 
invention may be used diagnostically in the same way that a 
target -binding antibody is used. Thus, depending on the assay 

10 format, it may be used to assay the target, or by competitive 
inhibition, other substances which bind the target. The sample 
will normally be a biological fluid, such as blood, urine, 
lymph, semen, milk, or cerebrospinal fluid, or a fraction or 
derivative thereof, or a biological tissue, in the form of, 

15 e.g., a tissue section or homogenate . However, the sample 
conceivably could be (or derived from) a food or beverage, a 
pharmaceutical or diagnostic composition, soil, or surface or 
ground water. If a biological fluid or tissue, it may be taken 
from a human or other mammal, vertebrate or animal, or from a 

20 plant. The preferred sample is blood, or a fraction or 
derivative thereof. 

In one embodiment, the binding molecule is insolubilized 
by coupling it to a macromolecular support, and target in the 
sample is allowed to compete with a known quantity of a labeled 

25 or specifically labelable target analogue. (The conjugate of 
the binding molecule to a macromolecular support is another 
diagnostic agent within the present invention.) The "target 
analogue" is a molecule capable of competing with target for 
binding to the binding molecule, and the term is intended to 

30 include target itself. It may be labeled already, or it may 
be labeled subsequently by specifically binding the label to 
a moiety differentiating the target analogue from authentic 
target. The solid and liquid phases are separated, and the 
labeled target analogue in one phase is quantified. The higher 

35 the level of target analogue in the solid phase, i.e., sticking 
to the binding molecule, the lower the level of target analyte 
in the sample . 

In a "sandwich assay", both an insolubilized target- 
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binding molecule, and a labeled target -binding molecule are 
employed. The target analyte is captured by the insolubilized 
target -binding molecule and is tagged by the labeled target - 
binding molecule, forming a tertiary complex. The reagents may 
5 be added to the sample in either order, or simultaneously. The 
target -binding molecules may be the same or different, and only 
one need be a target -binding molecule according to the present 
invention (the other may be, e.g., an antibody or a specific 
binding fragment thereof) . The amount of labeled target - 

10 binding molecule in the tertiary complex is directly 
proportional to the amount of target analyte in the sample. 

The two embodiments described above are both heterogeneous 
assays. However^ homogeneous assays are conceivable. The key 
is that the label be affected by whether or not the complex is 

15 formed. 

A label may be conjugated, directly or indirectly (e.g., 
through a labeled anti -target-binding molecule antibody) , 
covalently (e.g., with S^PDP) or noncovalent ly , to the target- 
binding molecule, to produce a diagnostic reagent. Similarly, 

20 the target binding molecule may be conjugated to a solid-phase 
support to form a solid phase ("capture") diagnostic reagent. 
Suitable supports include glass, polystyrene, polypropylene, 
polyethylene, dextran, nylon, amylases, natural and modified 
celluloses, polyacrylamides , agaroses, and magnetite. The 

25 nature of the carrier can be either soluble to some extent or 
insoluble for the purposes of the present invention. The 
support material may have virtually any possible structural 
configuration so long as the coupled molecule is capable of 
binding to its target. Thus the support configuration may be 

30 spherical, as in a bead, or cylindrical, as in the inside 
surface of a test tube, or the external surface of a rod. 
Alternatively, the surface may be flat such as a sheet, test 
strip, etc. 

Preferred In Vitro Screening Assays 
35 Scintilla.tion Proximity Assay (SPA): 

An SPA is a homogeneous assay which relies on the short 
penetration range in solution of beta particles from certain 
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isotopes, such as ^H, ^^^I, ^^P and ^^S . 

In a competitive SPA, the scintillant (which emits light 
when a beta particle passes close by) is conjugated to an 
analyte binding molecule. The analyte is allowed to compete 
5 with a short range beta particle-emitting radiolabeled analyte 
analogue for binding to the ABM. If the analyte analogue 
binds, the beta particles emitted by its label come close 
enough to stimulate the scintillant. 

Usually, the scintillant is embedded in beads, or in the 
10 walls of the wells of a microtiter plate. 

In a sandwich SPA, the scintillant- ABM conjugate binds the 
analyte, and a second radiolabeled ABM also binds the analyte, 
thereby forming a ternary complex. 

There are practical reasons for using, instead of a 
15 scintillant -ABM conjugate, a primary simple ABM reagent, and 
a scintillant- (anti-ABM) conjugate acting as a secondary 
reagent which binds the primary reagent . The ABM of the 
primary reagent could then be a mouse monoclonal antibody, and 
the ant i -ABM of the secondary reagent a cheaper polyclonal 
20 anti-mouse antibody, usable in assays for different analytes . 

Fluorescence Pola.rization (FP) : A method for detection of 
ligand binding that results in a change of the rotational 
relaxation time of the fluorescent label reflecting in a change 
in the total molecular mass of the complex containing the 

25 fluorescent ligand. A measurement is taken by excitation of 
the fluorescent moiety on the ligand by light of the proper 
wavelength that has passed through a polarizing filter and 
performing two measurements on the emitted light. The first 
measurement is performed by passing the light through a 

30 polarizing filter that is parallel to the polarization of the 
excitation polarizer.' The second measurement is performed by 
passing the light through a polarizing filter that is 
perpendicular to the polarization of the excitation polarizer. 
The intensities of the emitted light from the parallel and 

35 perpendicular measurements are used to determine the 
polarization of the fluorescent ligand by the following 

equation mP = [(Iparallel - ^perpendicular) / ( Iparallel + 
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^perpendicular) xlOOO] . Atl increase in mP indicates that more 
polarized light is being emitted and corresponds to the 
formation of a complex. 



Fluorescence Resonance Energy Transfer (FRET) : A method for 
5 detection of complex formation, such as ligand-receptor 
binding, that relies upon the through- space interactions 
between two fluorescent groups. A fluorescent molecule has a 
specific wavelength for excitation and another wavelength for 
emission. Pairs of fluorophores are selected that have an 

10 overlapping emission and excitation wavelength. Paired 
fluorophores are detected by a through- space interaction 
referred to as resonance energy transfer. When a donor 
fluorophore is excited by light, it would normally emit light 
at a higher wavelength; however, during FRET energy is 

15 transferred from the donor to the acceptor fluorophore allowing 
the excited donor to relax to the ground-state without emission 
of a photon. The acceptor fluorophore becomes excited and 
release energy by emitting light at its emission wavelength. 
This means that when a donor and an acceptor fluorophore are 

20 held in close proximity (<100 Angstroms) , such as when one 
fluorophore is attached to a ligand and one is attached to a 
receptor and the ligand binds to the receptor, excitation of 
the donor is coupled with emission from the acceptor. 
Conversely, if no complex is formed the excitation of the donor 

2 5 results in no emission from the acceptor. A common 

modification of this technique, sometimes referred to as 
fluorescence quenching, is accomplished using an acceptor group 
that is not fluorescent but efficiently accepts the energy from 
the donor fluorophore. In this case, when a complex is formed 

3 0 the excitation of the donor fluorophore is not accompanied by 

light emission at any wavelength. When this complex is 
dissociated the excitation of the donor results in emission of 
light at the wavelength of the donor. 

Time -Resol ved Fl uorescence 
35 The basic fluorescence assays can be modified to increase 
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the signal to noise ratio. If there is a difference in the 
temporal behavior of signal fluorescence and background 
fluoresence, then ^'time-resolved fluorescence" may be used to 
better distinguish the two. 
5 One may measure the decay of the total fluorescence 

intensity, or the decay of the polarization anisotropy. 

In a time-resolved form of a FRET assay. Europium 
cryptate (EuK) serves as the donor fluorophore. The cryptate 
protects the europium ion from fluorescence quenching. The 

10 acceptor fluorophore is XL665, a modified allopycocyanine . The 
efficiency of FRET is 50% at a distance of 9 nm in serum, and 
the emission is at 665 nm. The XL665 emission is measured 
after a 50 microsec time delay (hence the name) which 
eliminates background (e.g., from free XL665 not stimulated by 

15 EuK) . This is possible because the XL665 emission is 
relatively long-lived . 

Fluorescence assays may be used in both cell-free and 
cell-based formats. Of course, for cell-based assays, the 
fluorophore labeled probes must be introduced into the cells 

20 in question. 

For more information on fluorescence assays, see Szollosi, 
et al.. Comm. Clin. Cytometry, 34:159-179 (1998); Millar, Curr . 
Op. Struct. Biol., 6:637-42 (1996); Mitra, et al . , Gene, 
173:13-17 (1996), Alfano, et al . , Ann. N.Y. Acad. Sci., 838:14- 

25 28 (1998); Lundblad, et al . , Mol . Endocrinol., 10:607-12 
(1996); Gonzalez and Negulescu , Curr. Op. Biotechnology, 9:624- 
31 (1998). For bioluminescence assays, see Stables, et al . , 
Anal. Biochem., 252:115-126 (1997). 

Biolocrical Assays 
30 While a major purpose of the invention is to minimize the 

need for biological assays, they cannot be altogether avoided. 

In order to predict the biological activity of a substance, one 

must know the biological activities of a reasonable number of 

reference substances . 
35 A biological assay measures or detects a biological 

response . of a biological entity to a substance. The present 
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invention is concerned with responses which are, at least in 
part, mediated by a receptor. 

The biological entity may be a whole organism, an isolated 
organ or tissue, freshly isolated cells, an immortalized cell 
5 line, or a subcellular component (such as a membrane; this term 
should not be construed as including an isolated receptor) . 
The entity may be, or may be derived from, an organism which 
occurs in nature, or which is modified in some way. 
Modifications may be genetic (including radiation and chemical 

10 mutants, and genetic engineering) or somatic (e.g., surgical, 
chemical, etc.). In the case of a multicellular entity, the 
modifications may affect some or all cells. The entity need 
not be the target organism, or a derivative thereof, if there 
is a reasonable correlation between bioassay activity in the 

15 assay entity and biological activity in the target organism. 

The entity is placed in a particular environment, which 
may be more or less natural. For example, a culture medium 
may, but need not, contain serum or serum substitutes, and it 
may, but need not, include a support matrix of some kind, it 

20 may be still, or agitated. It may contain particular 
biological or chemical agents, or have particular physical 
parameters (e.g., temperature), that are intended to nourish 
or challenge the biological entity. 

There must also be a detectable biological marker for the 

25 response. At the cellular level, the most common markers are 
cell survival and proliferation, cell behavior (clustering, 
motility) , cell morphology (shape, color) , and biochemical 
activity (overall DNA synthesis, overall protein synthesis, and 
specific metabolic activities, such as utilization of 

30 particular nutrients, e.g., consumption of oxygen, production 
of CO2, production of organic acids, uptake or discharge of 
ions) . 

The direct signal produced by the biological marker may 
be transformed by a signal producing system into a different 
3 5 signal which is more observable, for example, a fluorescent or 
colorimetric signal . 

The entity, environment, marker and signal producing 
system are chosen to achieve a clinically acceptable level of 
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. sensitivity, specificity and accuracy. 

Reference substances should be tested in the appropriate 
assays relevant to the tissue distribution of the targeted 
receptor. For instance, for the estrogen receptor which is 
5 expressed in breast epithelium, liver mesenchymal cells, 
osteoclasts and uterine epithelium (among others) appropriate 
assays would include, among others, breast and uterine 
epithelial cell proliferation, osteoclast apoptosis, and 
hepatocyte production of lipids such as triglycerides and 

10 cholesterol and lipoproteins such as high density lipoproteins 
and low density lipoproteins. 

If one were to utilize the androgen receptor which is 
expressed in, among others, prostate epithelium, hepatocytes, 
striated muscle cells, then one would might chose to carry out 

15 assays of the reference substance set for, among others, 
prostate hypertrophy, hyperplasia or prostate epithelial cell 
proliferation, muscle cell hyperplasia or hypertrophy and 
heptotoxicity etc . 

As another example, if one were to utilize the beta-2- 

20 adrenergic receptor, which is expressed in, among others, the 
heart, brain and peripheral vasculature, then one may chose to 
test reference substances in cardiac function assays (such as 
cardiac rate and eletrocardiographic changes) , assays for their 
impact on blood pressure and assays to evaluate their impact: 

25 on neuronal activity within the central nervous system. 

Cell-Based Assays 

In a preferred cell-based assay, the receptor is 
functionally connected to a signal (biological marker) 
producing system, which may be endogenous or exogenous to the 
30 cell. 

" Zero-Hybrid" Systems 

In these systems, the binding of a peptide to the target 
protein results in a screenable or selectable phenotypic 
change, without resort to fusing the target protein (or a 
35 ligand binding moiety thereof) to an endogenous protein. It 
may be that the target protein is endogenous to the host cell. 
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or is substantially identical to an endogenous receptor so that 
it can take advantage of the latter' s native signal 
transduction pathway. Or sufficient elements of the signal 
transduction pathway normally associated with the target 
5 protein may be engineered into the cell so that the cell 
signals binding to the target protein. 

"One -Hybrid" Systems 

In these systems, a chimera receptor, a hybrid of the 
target, protein and an endogenous receptor, is used. The 
10 chimeric receptor has the ligand binding characteristics of the 
target protein and the signal transduction characteristics of 
the endogenous receptor. Thus, the normal signal transduction 
pathway of the endogenous receptor is subverted. 

Preferably, the endogenous receptor is inactivated, or the 
15 conditions of the assay avoid activation of the endogenous 
receptor, to improve the signal-to-noise ratio. 

See Fowlkes USP 5,789,184 for a yeast system. 
Another type of "one-hybrid" system combines a peptide: 
DNA-binding domain fusion with an unfused target receptor that 
20 possesses an activation domain. 

"Two -Hybrid" System 

In a preferred embodiment, the cell -based assay is a two 
hybrid system. This term implies that the ligand is 
incorporated into a first hybrid protein, and the receptor into 

25 a second bybrid protein. The first hybrid also comprises 
component A of a signal generating system, and the second 
hybrid comprises component B of that system. Components A and 
B, by themselves, are insufficient to generate a signal. 
However, if the ligand binds the receptor, components A and B 

30 are brought into sufficiently close proximity so that chey can 
cooperate to generate a signal. 

Components A and B may naturally occur, or be 
substantially identical to moieties which naturally occur, as 
components of a single naturally occurring biomolecule, or they 

35 may naturally occur, or be substantially identical to moieties 
which naturally occur, as separate naturally occurring 
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biomolecules which interact in nature. 

Two-Hybrid System: Transcription Factor Type 

In a preferred "two-hybrid" embodiment, one member of a 
peptide ligand : receptor binding pair is expressed as a fusion 
5 to a DNA-binding domain (DBD) from a transcription factor (this 
fusion protein is called the "bait"), and the other is 
expressed as a fusion to a transactivation domain (TAD) (this 
fusion protein is called the "fish", the "prey", or the 
"catch") . The transactivation domain should be complementary 
10 to the DNA-binding domain, i.e., it should interact with the 
latter so as to activate transcription of a specially designed 
reporter gene that carries a binding site for the DNA-binding 
domain. Naturally, the two fusion proteins must likewise be 
complementary . 

15 This complementarity may be achieved by use of the 

complementary and separable DNA-binding and transcriptional 
activator domains of a single transcriptional activator 
protein, or one may use complementary domains derived from 
different proteins. The domains may be identical to the native 

20 domains, or mutants thereof. The assay members may be fused 
directly to the DBD or TAD, or fused through an intermediated 
linker. 

The target DNA operator may be the native operator 
sequence, or a mutant operator. Mutations in the operator may 

25 be coordinated with mutations in the DBD and the TAD. An 
example of a suitable transcription activation system is one 
comprising the DNA-binding domain from the bacterial repressor 
LexA and the activation domain from the yeast transcription 
factor Gal4, with the reporter gene operably linked to the LexA 

3 0 operator. 

It is not necessary to employ the intact target receptor; 
just the ligand-binding moiety is sufficient. 

The two fusion proteins may be expressed from the same or 
different vectors. Likewise, the activatable reporter gene may 
35 be expressed from the same vector as either fusion protein (or 
both proteins) , or from a third vector. 

Potential - DNA-binding domains include Gal4 , LexA, and. 
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mutant domains substantially identical to the above. 

Potential activation domains include E. coli B42, Gal4 
activation domain II, and HSV VP16, and mutant domains 
substantially identical to the above. 
5 Potential operators include the native operators for the 

desired activation domain, and mutant domains substantially 
identical to the native operator. 

The fusion proteins may comprise nuclear localization 
signals . 

10 The assay system will include a signal producing system, 

too. The first element of this system is a reporter gene 
operably linked to an operator responsive to the DBD and TAD 
of choice. The expression of this reporter gene will result, 
directly or indirectly, in a selectable or screenable phenotype 

15 (the signal) . The signal producing system may include, besides 
the reporter gene, additional genetic or biochemical elements 
which cooperate in the production of the signal. Such an 
element could be, for example, a selective agent in the cell 
growth medium. There may be more than one signal producing 

20 system, and the system may include more than one reporter gene. 

The sensitivity of the system may be adjusted by, e.g., 
use of competitive inhibitors of any step in the activation or 
signal production process, increasing or decreasing the number 
of operators, using a stronger or weaker DBD or TAD, etc. 

25 When the signal is the death or survival of the cell in 

question, or proliferation or nonprolif eration of the cell in 
question, the assay is said to be a selection. When the signal 
merely results in a detectable phenotype by which the 
signalling cell may be differentiated from the same cell in a 

30 nonsignalling state (either way being a living cell) , the assay 
is a screen. However, the term "screening assay" may be used 
in a broader sense to include a selection. When the narrower 
sense is intended, we will use the term "nonselective screen" . 
Various screening and selection systems are discussed in 

35 Ladner, USP 5,198,346. 

Screening and selection may be for or against the peptide: 
target protein or compound : target protein interaction. 

Preferred assay cells are microbial (bacterial, yeast. 
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algal, protozooal) , invertebrate (esp. mammalian, particularly 
human) . The best developed two-hybrid assays are yeast and 
mammalian systems . 

Normally, two hybrid assays are used to determined whether 
5 a protein X and a protein Y interact, by virtue of their 
ability to reconstitute the interaction of the DBD and the TAD. 
However, augmented two-hybrid assays have been used to detect 
interactions that depend on a third, non-protein ligand. 

For more guidance on two-hybrid assays, see Brent and 

10 Finley, Jr., Ann. Rev. Genet., 31:663-704 (1997); Fremont- 
Racine, et al.. Nature Genetics, 277-281 (16 July 1997) ; Allen, 
et al., TIBS, 511-16 (Dec. 1995); LeCrenier, et al . , BioEssays, 
20:1-6 (1998); Xu, et al . , Proc . Nat. Acad. sci. (USA), 
94:12473-8 (Nov. 1992); Esotak, et al . , Mol . Cell. Biol., 

15 15:5820-9 (1995); Yang, et al . , Nucleic Acids Res., 23:1152-6 
(1995); Bendixen, et al . , Nucleic Acids Res . , 22:1778-9 (1994); 
Fuller, et al . , BioTechniques , 25:85-92 (July 1998); Cohen, et 
al.,-PNAS (USA) 95:14272-7 (1998); Kolonin and Finley, Jr.-, 
PNAS (USA) 95:14266-71 (1998). See also Vasavada, et al . , PNAS 

20 (USA), 88:10686-90 (1991) (contingent replication assay), and 
Rehrauer, et al . , J. Biol. Chem., 271:23865-73 91996) (LexA 
repressor cleavage assay) . 

Two-Hybrid Systems: Reporter Enzyme type 

In another embodiment, the components A and B reconstitute 
25 an enzyme which is not a transcription factor. It may, for 
example, be DHFR, or one of the other enzymes identified in 
WO98/34120 . 

As in the last example, the effect of the reconstitution 
of the enzyme is a phenotypic change which may be a screenable 

3 0 change, a selectable change, or both. 

Universite de Montreal, WO98/34120 describes the use of 
protein- fragment complementation assays to detect biomolecular 
interactions in vivo and in vitro. Fusion peptides respectively 
comprising N and C terminal fragments of murine DHFR were 

35 fused to GCN4 leucine zipper sequences and co-expressed in 
bacterial cells whose endogenous DHFR activity was inhibited. 
DHFR is composed of three structural fragments forming two 
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domains; the discontinuous 1-4 6 and 106-18 6 fragments form one 
domain and the 47-105 fragment forms the other. WO98/34120 
cleaved DHFR at residue 107. GCN4 is a homodimerizing protein. 
The homodimerization of GCN4 causes reassociation of the two 
5 DHFR domains and hence reconstitution of DHFR activity. 

WO98/34120 suggest that fragments of other enzyme reporter 
molecules could be used in place of DHFR. 

See also, Pelletier, et al., Proc. Nat. Acad. Sci . USA, 
95: 12141-6 (1998) (same system); 

10 Karimova et al . , Proc. Nat. Acad. Sci. USA 95:5752-6 

(1998) discloses a bacterial two-hybrid system, in which the 
catalytic domain of Bordetella pertussis adenylate cyclase 
reconstituted as a result of interaction of two proteins, 
leading to cAMP synthesis) . 

15 In a similar system, designed to distinguish 

heterodimerization as distinct from homodimerization, one test 
protein was fused to native LexA and the other to a mutant of 
LexA with altered DNA specificity. Normally, LexA dimerizes 
to bind its target operator. Because of the mutation, and the 

20 use of a hybrid operator, only a heterodimer could achieve DNA 
binding. See Dmitrova, et al , , Mol . Gen. Genet., 57: 205-212 
(1998) . 

Stanford U. , WO98/44350 describes a reporter subunit 
complementation assay which employs fusion proteins each 

25 compromising one of a pair of weakly complementing, singly 
inactive, beta galactosidase mutants, which complement each 
other to produce an active beta galactosidase. See also Rossi, 
et al., Proc. Nat. Acad. Sci. USA, 94:8405-10 (1997); Mohler 
and Blau, Proc. Nat. Acad. Sci. USA, 93: 12423-7 (1996). 

30 Cornell U. , W098/34948 describes a strategy for the 

identification of small peptides that activate or inactivate 
a G protein coupled receptor. The peptides of a combinatorial 
peptide library are tethered to a GPCR of interest in a cell, 
and the cell is monitored to determine whether the peptide is 

35 an agonist or an antagonist. The peptide is tethered to the 
GPCR by replacing the N-terminal of the GPCR with the N- 
terminus of a self -activating receptor, and replacing the 
natural peptide ligand. present therein with the library 
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peptide. An example of a self -activating receptor would be the 
thrombin receptor. 

Sadee, USP 5,882,944 discloses a cell-based assay for the 
effect of test compounds on ml receptors in which the cells are 
5 incubated with an ml agonist to const itutively activate them, 
the agonist is removed, the baseline activity of the receptor 
is determined, the cells are exposed to the test compound, and 
the receptor activity is compared to the baseline level. The 
activity measured may be directed to cAMP, GTPase, or GTP 
10 exchange, 

Martin, et al . , J. Biol. Chem. , 271: 361-6 (1996) 
describes the screening of a combinatorial peptide-on-plasmid 
library based on the C terminus of the alpha subunit of Gsubt 
(340-350) for peptides which bind rhodopsin. In the library, 

15 the library peptides are fused to the C terminus of the DNA 
binding protein lad, which binds to lacO DNA sequences on the 
vector expressing the peptide. In the random DNA, the base mix 
was chosen so as to yield roughly a 50% chance that a given 
codon would be mutated to yield a different amino acid. 

20 Stables, et al . , Anal. Biochem. , 252: 115-126 (1997) 

describes a cell-based bioluminescent assay for GPCR agonist 
activity. The GPCR is co-expressed with apoaequorin, a 
calcium-sensitive photoprotein . Agonist binding to a receptor 
which activates certain G-alpha subunits, such as G-alphal6, 

25 results in an increase in intracellular calcium concentration 
and subsequent bioluminescence . 
Relative Affinity 

Where this specification indicates that a molecule B binds - 
a target Tl substantially more strongly than a target T2 , or 

3 0 that a molecule Bl binds a target T substantially more strongly 
than an alternative molecule B2 binds the same target T, it 
means that the difference in binding is detectable and is 
manifest to a useful degree in the relevant context, e.g., 
screening, diagnosis, purification, or therapy. 

35 Generally speaking, a tenfold difference in binding will 

be considered substantial, however this is not necessarily 
required . 
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" Substantially Identical " 

A mutant protein (peptide) is substantially identical to 
a reference protein (peptide) if (a) it has at least 10% of a 
specific binding activity or a non-nutritional biological 
5 activity of the reference protein, and (b) is at least 50% 
identical in amino acid sequence to the reference protein 
(peptide) . 

Percentage amino acid identity is determined by aligning 
the mutant and reference sequences according to a rigorous 

10 dynamic programming algorithm which globally aligns their 
sequences to maximize their similarity, the similarity being 
scored as the sum of scores for each aligned pair according to 
an unbiased PAJy[250 matrix, and a penalty for each internal gap 
of -12 for the first null of the gap and -4 for each additional 

15 null of the same gap. The percentage identity is the number 
of matches expressed as a percentage of the adjusted (i.e., 
counting inserted nulls) length of the reference sequence. 

A mutant DNA sequence is substantially identical to a 
reference DNA sequence if they are structural sequences, and 

20 encoding mutant and reference proteins which are substantially 
identical as described above. 

If instead they are regulatory sequences, they are 
substantially identical if the mutant sequence has at least 10% 
of the regulatory activity of the reference sequence, and is 

25 at least 50% identical in nucleotide sequence to the reference 
sequence. Percentage identity is determined as for proteins 
except that matches are scored +5, mismatches -4, the gap open 
penalty is -12, and the gap extension penalty (per additional 
null) is -4 . 

30 Preferably, sequence which are substantially identical 

exceed the minimum identity of 50% e.g., are 51%, 66%, 75%, 
80%, 85%, 90%, 95% or 99% identical in sequence. 

DNA sequences may also be considered "substantially 
identical" if they hybridize to each other under stringent 

35 conditions, i.e., conditions at which the Tm of the 
heteroduplex of the one strand of the mutant DNA and the more 
complementary strand of the reference DNA is not in excess of 
10 °C. less than -.-the Tm of the reference DNA homoduplex. 
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Typically this will correspond to a percentage identity of 85- 
90% . 

" Conservative Modifications " 

"Conservative modifications" are defined as 
5 (a) conservative substitutions of amino acids as 

hereafter defined; or 

(b) single or multiple insertions (extension) or 
deletions (truncation) of amino acids at the 
termini . 

10 "Semi -Conservative Modifications" are modifications which 

are not conservative, but which are (a) semi -conservative 
substitutions as hereafter defined; or (b) single or multiple 
insertions or deletions internally, but at interdomain 
boundaries, in loops or in other segments of relatively high 

15 mobility. 

The term "conservative" is used here in an a priori sense, 
i.e., modifications which would be expected to preserve 3D 
structure and activity, based on analysis of the naturally 
occurring families of homologous proteins and of past 
20 experience with the effects of deliberate mutagenesis, rather 
than post facto , a modification already known to conserve 
activity. Of course, a modification which is conservative a 
priori may, and usually is, also conservative post facto . 

Preferably, except at the termini, no more than about five 
25 amino acids are inserted or deleted at a particular locus, and 
the modifications are outside regions known to contain binding 
sites important to activity. 

Preferably, insertions or deletions are limited to the 
termini . 

3 0 A conservative substitution is a substitution of one amino 

acid for another of the same exchange group, the exchange 
groups being defined as follows 

I Gly, Pro, Ser, Ala (Cys) (and any nonbiogenic, 
neutral amino acid with a hydrophobicity not 

35 exceeding that of the aforementioned a.a.'s) 

II Arg, Lys, His (and any nonbiogenic, positively- 
charged amino acids) 
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III Asp, Glu, Asn, Gin (and any nonbiogenic negatively- 
charged amino acids) 

IV Leu, lie. Met, Val (Cys) (and any nonbiogenic, 
aliphatic, neutral amino acid with a hydrophobic ity 

5 too high for I above) 

V Phe, Trp, Tyr (and any nonbiogenic, aromatic neutral 
amino acid with a hydrophobicity too high for I 
above) . 

Note that Cys belongs to both I and IV. 

10 Residues Pro, Gly and Cys have special conformational 

roles. Cys participates in formation of disulfide bonds. Gly 
imparts flexibility to the chain. Pro imparts rigidity to the 
chain and disrupts o; helices. These residues may be essential 
in certain regions of the polypeptide, but subst itutable 

15 elsewhere. 

One, two or three conservative substitutions are more 
likely to be tolerated than a larger number. 

"Semi -conservative substitutions" are defined herein as 
being substitutions within supergroup I/II/III or within 
20 supergroup IV/V, but not within a single one of groups I-V. 
They also include replacement of any other amino acid with 
alanine. If a substitution is not conservative, it preferably 
is semi-conservative . 

"Non-conservative substitutions" are substitutions which 
25 are not conservative. They include " semi -conservative 

substitutions" as a subset. 

"Highly conservative substitutions" are a subset of 
conservative substitutions, and are exchanges of amino acids 
within the groups Phe/Tyr/Trp, Met/Leu/Ile/Val , His/Arg/Lys, 
3 0 Asp/Glu and Ser/Thr/Ala. They are more likely to be tolerated 
than other conservative substitutions. Again, the smaller the 
number of substitutions, the more likely they are to be 
tolerated . 

A protein (peptide) is conservatively identical to a 
35 reference protein (peptide) it differs from the latter, if at 
all, solely by conservative modifications, the protein (peptide 
remaining at least seven amino acids long if the reference 
^.protein (peptide) was at least seven amino acids long. 
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A protein is at least semi-conservatively identical to a 
reference protein (peptide) if it differs from the latter, if 
at all, solely by semi -conservative or conservative 
modifications . 

5 A protein (peptide) is nearly conservatively identical to 

a reference protein (peptide) if it differs from the latter, 
if at all, solely by one or more conservative modifications 
and/or a single nonconservative substitution. 

It is highly conservatively identical if it differs, if 

10 at all, solely by highly conservative substitutions. 

The core sequence o^f a reference protein (peptide) is the 
largest single fragment which retains at least 10% of a 
particular specific binding activity, if one is specified, or 
otherwise of at least one specific binding activity of the 

15 referent. If the referent has more than one specific binding 
activity, it may have more than one core sequence, and these 
may overlap or not . 

If it is taught that a peptide of the present invention 
may have a particular similarity relationship (e.g., markedly 

20 identical) to a reference protein (peptide) , preferred peptides 
are those which comprise a sequence having that relationship 
to a core sequence of the reference protein (peptide) , but with 
internal insertions or deletions in either sequence excluded. 
Even more preferred peptides are those whose entire sequence 

25 has that relationship, with the same exclusion, to a core 
sequence of that reference protein (peptide) . 

The Biokeys of the present invention include not only the 
listed (reference) peptides, but also other peptides which are 
markedly identical. Preferably, the degree of identity 

30 (similarity) is higher than merely markedly identical. 

Where this specification sets forth a consensus sequence 
for a particular class of peptides then any peptide comprising 
said consensus is a preferred peptide according to this 
invention . 

3 5 "Non-Naturally Occurring" 

Reference to a peptide or protein as "non-naturally 
occurring" means that it does not occur, as a unitary molecule. 
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in non-genetically engineered cells or viruses. It may be 
biologically produced in genetically engineered cells, or 
genetically engineered virus-transf ected cells, and it may be 
a segment of a larger, naturally occurring protein. 
5 If it is disclosed that a peptide preferably is not 

naturally occurring, it more preferably is not conservatively 
identical to any naturally occurring peptide. 



Design of Chimeric Proteins 

A chimeric protein is a hybrid of two or more different 

10 proteins. The proteins are usually related, e.g., a 

statistically significant (at least 6 sigma) alignment when 
aligned as described above, and compared to the similar 
alignment of jumbled sequences. More often are "substantially 
identical" as defined above. 

15 Functional chimeras may be identified by a systematic 

synthesize-and-test strategy. It is not necessary that all 
theoretically conceivable chimeras be evaluated directly. 

One strategy is described schematically below. We divide 
the aligned protein sequences into two or more testable units. 

2 0 These units may be equal or unequal in length. Preferably, the 

units correspond to functional domains or are demarcated so as 
to correspond to special features of the sequence, e.g., 
regions of unusually high divergence or similarity, conserved 
or unconserved regions in the relevant protein family or the 
25 presence of a sequence motif, or an area of unusual 
hydrophilicity or hydrophobicity . Let "A" represent a unit of 
protein A, and "B" a corresponding unit of protein B. If there 
are five units (the choice of five instead of two, three, four, 
six, ten, etc. is arbitrary), we can synthesize and test any 

3 0 or all of the following chimeras, which 

will help us rapidly localize the critical regions: 

(a) progressive C-terminal substitution of B sequence for 
A sequence, e.g., 

A A A A A 



35 



A A A A B 
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A A A B B 
A A B B B 
A B B B B 
B B B B B 

5 (b) progressive N-terminal substitution of B sequence for 

A sequence 

A A A A A 
B A A A A 
B B A A A 

10 B B B A A 

B B B B A 
B B B B B 

(c) dual terminal substitutions, e.g., 
B B B B B 

15 A B B B A 

A A B A A 



A A A A A 



and 



A A A A A 

20 B A A A B 

B B A B B 
B B B B B, 

and 

(d) single replacement "scans," such as 
25 B A A A A 

A B A A A 
A A B A A 
A A A B A 
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A A A A B 

and 

A B B B B 
B A B B B 
5 B B A B B 

B B B A B 
B B B B A 

Based on the data these tests provide, it may appear that, 
e.g., the key difference between the A and B sequences vis-a- 
10 vis a property of interest, is in the fifth unit. One can then 
subdivide that unit into subunits and test further, e.g. 

B B B B (bb) 

B B B B (ba) 

B B B B (ab) 

15 B B B B (aa) 

where the parenthesis refer to two subunits into which the 
fifth unit was subdivided. 

Design of Functional Mutants, Generally 

A protein is more likely to tolerate a mutation which 

20 (a) is a substitution rather than an insertion or 

deletion; 

(b) is an insertion or deletion at the terminus, 
rather than internally, or, if internal, is at a 
domain boundary, or a loop or turn, rather than in 

25 an alpha helix or beta strand; 

(c) affects a surface residue rather than an 
interior residue; 

(d) affects a part of the molecule distal to the 
binding site; 

30 (e) is a substitution of one amino acid for another 

of similar size, charge, and/or hydrophobicity , and 
does not destroy a disulfide bond or other 
crosslink; and 
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(f) is at a site which is subject to substantial 
variation among a family of homologous proteins to 
which the protein of interest belongs. 
These considerations can be used to design functional mutants. 

5 Surface vs. Interior Residues 

Charged residues almost always lie on the surface of the 
protein. For uncharged residues, there is less certainty, but 
in general, hydrophilic residues are partitioned to the surface 
and hydrophobic residues to the interior. Of course, for a 
10 membrane protein, the membrane- spanning segments are likely to 
be rich in hydrophobic residues. 

Surface residues may be identified experimentally by 
various labeling techniques, or by 3-D structure mapping 
techniques like X-ray diffraction and NMR. A 3-D model of a 
15 homologous protein can be helpful. 

Binding Site Residues 

Residues forming the binding site may be identified by (1) 
comparing the effects of labeling the surface residues before 
and after complexing the protein to its target, (2) labeling 

20 the binding site directly with affinity ligands, (3) 
fragmenting the protein and testing the fragments for binding 
activity, and (4) systematic mutagenesis (e.g., alanine- 
scanning mutagenesis) to determine which mutants destroy 
binding. If the binding site of a homologous protein is known, 

25 the binding site may be postulated by analogy. 

Protein libraries may be constructed and screened that a 
large family (e.g., 10®) of related mutants may be evaluated 
simultaneously . 

G Proteins 

30 This section is condensed from Fowlkes, USP 5,789,184, 

incorporated by reference in its entirety. 



Mammalian Q Protein Alpha Subunits 
Stimulatory (Gas) Subunits 

Through reconst itut ion analysis of the cyc~ mutant of S4 9 
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murine lymphoma cells, the Gofs protein was identified (Ross and 
Oilman (1977) J. Biol. Chem. 252, 6966-6969) as a stimulatory 
guanine nucleotide-binding protein that coupled hormone 
receptors to adenylyl cyclase. Mammalian Gas cDNA clones have 
5 been obtained from human brain (Bray et al . 1986, 1987), human 
liver (Mattera et al . 1986) , bovine brain (Harris et al . 1985) , 
bovine adrenal gland (Robishaw et al , 1986), bovine cerebral 
cortex (Nukada et al . 1986), hamster lung fibroblasts (Mercken 
et al. 1990), rat glioma cells (Itoh et al . 1986, 1988), rat 

10 olfactory neuroepithelium (Jones and Reed 1987) , mouse 
macrophages (Sullivan et al . 1986), and mouse lymphoma cells 
(Sullivan et al . 1987; Rail and Harris 1987). Bray et al . 
(1986) isolated four different Gofs cDNAs from human brain 
(Gq'sI-4); these forms appear to arise from a single Gofs gene 

15 by alternate splicing. The Gofs gene contains 13 exons (Kozasa 
et al . 1988) which are all present in the long form of Gees. 
A short form of the molecule lacks the 15 amino acids encoded 
by exon 3. In addition, two alternate mRNAs arise that differ 
in the presence or absence of a serine codon at the start" of 

20 exon 4 when different splice sites are used at the 5' end of 
that exon. 

Inhibitory (Gofi) Subunits 

The B. pertussis toxin was found to (1) abolish the 
hormonal inhibition of adenylyl cyclase and (2) to ADP- 

25 ribosylate a 41-kd membrane protein. Purification of this 
toxin substrate permitted its identification as a guanine 
nucleotide-binding protein related to the mammalian G proteins 
Gas and transducin. The protein was denoted Ga± (i= inhibitory 
for adenylyl cyclase) . 

3 0 Three single copy genes encode G protein subunits of the 

Gofi type and the predicted proteins (Gofi-l, Gckfi-2 and GQ'i-3) 
share 85% sequence identity. In coupling to adenylyl cyclase 
to signal inhibition of this enzyme, the Gckfi proteins function 
in concert with Gas to control cellular cAMP levels. 

35 Gofi-l cDNA clones obtained to date are human (Bray et al . 

1987), bovine (Nukada et al . 1986) and rat (Jones and Reed 
1987). Human (Itoh et al . 1988; Weinstein et al . 1988; Beals 
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et al. 1987; Michel et al . 1986; Didsbury et al . 1987), rat 
(Jones and Reed 1987; Itoh et al . 1986) , mouse (Sullivan et al . 
1986) and bovine (Yatomi et al , 1992) Gai-2 cDNA clones have 
been isolated. Gofi-S clones include those from human (Itoh et 
5 al 1988; Seals et al . 1088;Suki et al . 1987; Kim et al . 1988) 
and rat (Itoh et al . 1988; Jones and Reed 1987) . 

It is likely that a subset of the possible amino acid 
substitutions that could be made in human Gofi could yield fully 
functional, albeit mutant, protein. It is possible that the 
10 following mutations would not alter the wild type activity of 
the protein: Ala59Asp, Glu64Asp, Aspl60Glu, Alal63Ser, 
Val332Ile. It is probable that other amino acid substitions 
not specifically cited here could be made without any 
diminishment of wild type Gofi activity. 

15 Use of Structural Models to Desicrn Chimeric or Other Mutant G 
Proteins 

Models of Gof protein structure may be used to predict 
. amino acid modifications which would not be harmful to 
activity. Analysis of Go; cDNAs and comparison to conserved 
20 sequences present in members of the GTPase superfamily has 
permitted the identification of five conserved stretches, GI- 
GS, located throughout a "composite" Go; molecule [Conklin and 
Bourne (1993) ; Bourne et al . (1991) . In addition, the location 
of putative cy-helices, (3 strands, loop domains and insertions 

2 5 have been deduced by a comparison of Go; sequences with the 

known secondary structure of p21 Thus of-helices 1-5, 

strands 1-6, loops 1-10 and inserts 1-4 have been assigned 
position in the primary Go. sequence based on comparisons with 
Ras proteins. Biochemical and genetic studies as well as 

3 0 sequence analysis have led to the delineation of a conceptual 

model of the Go. protein (Conklin and Bourne 1993). This 
conceptual model hypothesizes that while the guanine nucleotide 
binding pocket of Go. is oriented toward the cytoplasm, residues 
that interact with receptors, effectors and with the G^y 
35 complex face the plasma membrane. The model also asserts the 
following : 

(1) The N terminus of Gof is a major site for interaction with 
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the GjSy complex. 

(2) The Qf2 helix and insert 1 regions also contribute to the 
interaction of Go? with G/Sy . 

(3) At least three regions are hypothesized to interface with 
5 receptor : the amino and carboxyl termini and the conserved G5 

sequence. In the conceptual model the termini rest on the 
portion of Go; which faces the plasma membrane while the G5 
sequence sits at the "top" of the molecule. 

(4) The sequences purported to be involved in the interaction 
10 of Gof with effector molecules are envisioned to reside on the 

plasma membrane -proximal aspect of Go?. These sequences include 
the distal half of the a.2 helix, the insert 2 -loop 7 sequence 
and the insert 4 -loop 9 sequence. 

The orientation of the molecule in this conceptual model 

15 is supported in part by experimental evidence that assigns 
specific amino acids to the GTP binding site based on mutations 
which have been shown to constitutively activate Gcy by 
inhibiting the GTPase activity of the protein. The mutations 
in question are homologs of GTPase- inhibiting mutations of 

2 0 p21^^^. 

Monoclonal antibodies generated against N- terminal 
sequence cause the dissociation of the Gof^i heterotrimer ; in 
addition, N-terminally myristilated peptide inhibits the 
binding of Gofti to G/Sy in competitive fashion. Chemical cross- 

25 linking experiments indicate the close proximity of the Qf2 
helix and GjSy and a specific Gas mutation (G226A) exhibits two 
deficiencies: the Qf2 helical region does not undergo GTP- 
induced conformational change and GTP does not trigger the 
dissociation of G^y from Gokfs . The sequence denoted as the Qr2 

30 helix (analogous to the Q!2 helix of p21''^®) is more highly 
conserved than any other sequence in Go;; this strict 
conservation further supports an involvement of the helix in 
interaction with GjSy in that the formation of the heterotrimer 
underlies signalling in all G protein pathways described to 

35 date. 

Additional data has contributed to the development of the 
conceptual model of Go;. The amino and carboxyl termini of Ga 
appear to be in close proximity based on cross- linking studies 
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done using mastoparan and based on the specificity of 
monoclonal antibodies directed against Ga^i . Experimental 
evidence also suggests the proximity of the C terminus and the 
region that is analogous to the olI helix of p21^^^. Finally, 
5 insert 1, a large sequence located within loop 2, appears to 
have GAP function and folds as a domain distinct from the 
GTPase domain [Markby et al . (1993)]. 

Experimental evidence indicates that three regions of Go? 
(the N and C termini and the conserved G5 region) contact the 

10 receptor. In addition, Conklin et al . (1993) have obtained 
data which suggests that amino acid residues at the extreme C- 
terminus of Go: contribute to the specificity of receptor-G 
protein interactions. Thus chimeras constructed to replace 4-9 
residues at the extreme C- terminus of Gck?q with amino acids 

15 derived from the same region of Gofi resulted in a Go? protein 
that can transduce signal from D2 dopamine and adenosine 
receptors to phospholipase C, a Gq-specific effector. These 
receptors normally couple to Gofi . 

A glycine residue at the -3 position relative to the C- 

20 terminus is central to the formation of a ^-turn in this region 
of the Gof molecule; the jS-turn appears to be the structural 
signal that specifies interaction between receptors and a. 
subunits of the Gofi, Gofo, Goft family [Dratz et al . (1993)]. 
It has been hypothesized that the interaction between receptor 

25 and the C-terminus of Gof results in the conformational change 
that leads to the open conformation of the latter molecule, 
i.e. , the configuration in which nucleotide exchange can occur. 

Mutagenesis of Gofs implicated three regions of the 
molecule (a portion of the ql2 helix, i2-L7 and i4-L9) in the 

30 activation of adenylyl cyclase. A second series of experiments 
utilized peptides derived from Gofti to deduce the region of 
that molecule that activates phosphodiesterase; peptides 
derived from i4-<9 mimicked the ability of Gofti'^^^P to 
stimulate cGMP-phosphodiesterase . The regions identified in 

35 effector activation reside on the face of the molecule believed 
to be oriented toward the plasma membrane; in addition, one of 
the implicated sequences (the Qr2 helix) is known to undergo 
conformational change induced by GTP . 
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Early crystal structure-based models considered the 
crystal structure of the GTP-binding domain of E. coli EF-Tu 
(Jurnak 1985; LaCour et al . 1985), as well as the crystal 
structure of Ha-ras-p21 (Holbrook and Kim 1989) . Recently, a 
5 crystal structure of transducin-a (Gtof) complexed with GTPyS 
has been obtained to a resolution of 2.2 angstrom units (Noel 
et al. 1993). Analysis of this crystal structure, together 
with the biochemical and genetic data described above, has been 
used to derive generalized structure/function relationships 

10 applicable to all Gof molecules. 

In the three-dimensional structure, two domains are most 
apparent in the GtOf-GTEyS complex, each flanking a guanine 
nucleotide binding cleft. These are (1) a highly conserved 
GTPase domain and (2) a highly helical domain that is unique 

15 to heterotrimeric G proteins. The GTPase domain is 

structurally similar to the GTPase domains of p21 Ras and EF-Tu 
and consists of five of-helices surrounding a six-stranded jS- 
sheet . The other domain is highly helical, unique to 
heterotrimeric G proteins, and connected to the GTPase domain 

20 by two linker sequences. The helical and GTPase domains appear 
to enclose the GTP7S molecule and an associated Mg^^ion. This 
arrangement suggests that a conformational alteration is 
required of the Ga molecule in order for nucleotide exchange 
to occur; it is likely that conformational changes in the 

25 linker sequences initiate the movement of the helical domain 
and the opening of the molecule. 

The crystal structure permits delineation of the residues 
of Gof which interact with the triphosphate portion of the GTP 
molecule, the essential Mg^ion, and the nucleoside. In GtOf, 

3 0 the residues that contact the nucleoside and the phosphates 
form part of the helical domain and linker 2. These regions 
are implicated in receptor-regulated nucleotide exchange. Noel 
et al (1993) cite extensive interactions between Go. residues 
and guanosine; a subset of these interactions are unique to G 

3 5 proteins while others are conserved among members of the GTPase 
superfamily. The linkages between the nucleotide binding sites 
and the surface of Ga that purportedly interacts with receptor 
are also described. The authors assert that "a mechanistically 



m m 

88 

important feature of this system is the elegant manner in which 
interactions with one portion of the nucleotide support 
contacts with another. It is likely that these tightly coupled 
interactions potentiate a highly cooperative receptor-mediated 
5 disassembly of the elements that so strongly secure GDP and GTP 
in the nucleotide-binding cleft" [Noel et al (1993)]. 

Experimental data exists which implicates specific ot- 
helices and ^ sheets (Qf2/jS4, a3/(35, a4/j86) in effector binding 
and activation. These regions were found to form a series of 

10 surface loops in the three-dimensional model derived from 
analysis of the Gtof-GTPyS crystals. Work done with Gta 
suggests interaction of these loops with phosphodiesterase or 
with the inhibitory y-subunits of that enzyme. Studies 
accomplished using Gsof/Giof chimeras suggest that these surface 

15 loops play a role in the regulation of adenylyl cyclase. In 
addition, the crystal structure of Gtof-CTPyS indicates how GTP 
may effect conformational change in these effector- interactive 
loops. Glycine residues in the a/y 2 helix interact with the 
7 -phosphate of GTP and are believed to be the source of the 

2 0 malleability required for the conformational change which 

occurs upon hydrolsis of GTP.. The GDP/GTP- induced changes in 
Qf2 are hypothesized to transmit to the a3 and q;4 loops through 
a connecting series of interhelical contacts , thus linking 
changes in the interaction of y-phosphate with a2 to the 
25 effector-binding surface loops. 

The crystal structure draws attention to two residues that 
appear to play a role in the hydrolysis of GTP. A conserved 
arginine residue (Arg 174) contacts the y-phosphate directly 
and may facilitate its release upon hydrolysis. Mutation of 

3 0 the cognate arginine in Gso; and Giof severely compromises GTPase 

activity and results in a const itutively active Ga . The 
structure also suggests the glutamine at position 203 as the 
initiator of the hydrolytic attack on the y-phosphate. Glu203 
appears to be appropriately oriented to activate a water 
35 molecule well -positioned for nucleophilic attack on the y- 
phosphate. This glutamate resides in the q?2 helix and is 
conserved among the family of Gof subunits . 

As indicated by the foregoing models of Go; structure, the 
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function of the molecule is dependent on its interaction with 
receptor, the jSy complex, GTP or GDP, and effector molecules. 
Mutation of residues that experimental evidence or crystal 
structure-derived data have indicated as contributing to these 
5 numerous interactions could compromise Ga function. The 
following sequences, residues and domains have been shown to 
be particularly important to Gof function: N- terminal residues, 
residues at the extreme C-terminus (particularly the glycine 
at position -3) , the highly conserved olI helix, cognates of Arg 

10 174 of Goft, cognates of glutamine 203 of Gckft , the GTPase 
domain, and the Qf2/j84, a3 / ^3 , and ot^/^S regions. Other 
sequences would appear to be important, based on their 
conservation among members of the GTPase family or in that they 
are unique to heterotrimeric G protein of-subunits. These 

15 include the conserved sequences G1-G5 and the regions 
identified as inserts through comparisons made between Go? and' 
Ras proteins. It must be stated that a definitive citation of 
functionally important regions cannot be made as this remains 
an intense area of research. 

20 It is expected that future studies will identify the 

residues in the larger domains cited above that contribute most 
to function and identify additional specific residues whose 
characteristics are central to Go? function. Although 
conservative mutations in important regions of Go. may leave the 

25 function of the molecule intact, the more radical the change, 
the higher the likelihood of interference with protein 
function. The models as outlined above underscore an important 
feature of the Ga molecule. Conformational change is inherent 
to the exchange of nucleotide and that exchange is central to 

3 0 Got function. The conformational change appears to occur as a 
wave of signal transmission from one domain of the molecule to 
another. The models thus emphasize that alterations in any one 
of the functional domains of the protein can affect the final 
transduction of signal, i.e., the function of the molecule 

35 depends on successful cooperativity of several domains. 

It must be stated that mutations can be made, however, 
which can contribute to the utility of the Go. protein in 
experimental systems. As an example, mutations which 
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compromise the GTPase function specifically, without affecting 
Gof interaction with effector proteins, results in a 
constitutively active protein. In a subset of experimental 
scenarios, a constitutively active Ga is a desirable molecular 
5 reagent . 

It may be possible to make the following conservative 
amino acid substitutions in the sequence of human Gofs without 
compromising the wild-type activity of the protein: Ilel83Leu, 
Aspl84Glu, Leul98Val, Val218Leu, and Ile373Val. It is likely 
10 that other conservative amino acid substitutions not 
specifically cited here may be made in the sequence of Gq;s 
without inducing significant change in the activity of the wild 
type protein. 

Mammalian G3 and Gy Subunits 

15 As of early 1994, at least four mammalian Gj8 subunits were 

known and had been cloned. Both human and bovine clones of GjSl 
(Codina J. et al . (1986) FEES Lett. 207, 187-192; Sugimoto K. 
et al. (1985) FEES Lett. 191, 235-240; Fong H.K.W. et al . 
(1986) Proc. Natl. Acad. Sci. USA 83, 2162-2166) and G/?2 (Fong 

20 H.K.W. et al. (1987) Proc. Natl. Acad. Sci. USA 84, 3792-3796; 
Gao B. et al . (1987) Proc. Natl. Acad. Sci. USA 84, 6122-6125) 
have been isolated. A human GiS3 (Levine M.A. et al . (1990) 
Proc. Natl. Acad. Sci. USA 87, 2329-2333) and mouse G(34: {Van 
Weizsacker E. et al . (1992) Eiochem. Eiophys . Res. Commun . 183, 

25 350-356) have also been cloned. Five complete mammalian Gy 
subunits have been cloned: bovine Gyl (Hurley J.E. (1985) Proc. 
Natl. Acad. Sci. 81, 6948-6952), bovine G72 (Robishaw J.D. 
(1989) J. Eiol. Chem. 264, 15758-15761), bovine G73 (Call J.J 
et al. (1992) J. Biol. Chem. 267, 24023-24027), bovine and rat 

30 G75 (Fisher K.J. and Aronson N.N. (1992) Mol . Cell. Eiol. 12, 
1585-1591), and bovine G77 (Call J.J. et al. (1992) J. Biol. 
Chem. 267, 24023-24027) . Part of a sixth Gy subuhit, G74 , has 
been isolated (Gautam N. et al . (1990) Proc. Natl. Acad. Sci. 
USA 87, 7973-7977) . 

3 5 Various combinations of GjS and Gy have been observed in 

vitro and therefore have the potential to be physiologically 
active. For example, GjSl can dimerize with G7I, G72 , G73> and 
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G75 (Ifiiguez-Lluhi J. A. (1992) J. Biol. Chem. 267, 23409- 
23417), and jSlyl, iSlY2, and 18175 stimulate phosphoinosit ide 
hydrolysis by phospholipase C (32 (Katz A. et al . (1992) Nature 
360, 686-688). Other combinations are not observed. For 
5 example, G^2 does not dimerize with G7I (Schmidt C.J. et al . 
(1992) J. Biol. Chem. 267, 13807-13810; ProninA.N. and Gautam 
N. (1992) Proc. Natl. Acad. Sci. USA 89, 6220-6224). The 
effects of jSy dimers on adenylyl cyclases depends on both the 
isoform of adenylyl cyclase and the particular jSy dimer in 
10 question. While type 1 adenylyl cyclase is inhibited to 
varying degrees by different /3y dimers, the same dimers will 
potentiate the stimulatory effect of Gofs on type 2 adenylyl 
cyclase (Iniguez-Lluhi J. A. (1992) J. Biol. Chem. 267, 23409- 
3 23417). In both cases the potencies of ^8172, 18173, (32y2 , and 

15 P2y3 are reported to be equivalent and greater than that of 
15 iSlTl. 

Id 

12 The following additional references may be of value: 

s GbetaS : 

IZ A fifth member of the mammalian G-protein beta-subunit 

fij 20 family. Expression in brain and activation of the beta 2 

isotype of phospholipase C. Watson AJ, Katz A Simon MI, 19994, 

J Biol Chem, 269:22150-6. 

GgammaS : 

A novel GTP-binding protein gamma - subuni t , GgammaS, is 

2 5 expressed during neurogenesis in the olfactory and vomeronasal 

neruoepithelia . Ryba NJP, Tirindelli R, J Biol Chem, 270:6757- 
6767, 1995. 

Ggamma7 : 

Selective Tissue Distribution of G protein gamma subunits, 

3 0 Including a new form of the gamma subunits identified by cDNA 

Cloning. Call JJ, Balcueva EA, Rybalkin I, and Robishaw JD, 
J Biol Chem, 267:24023-24027, 1992. 



Ggamma6 : 

Existence of -two gamma subunits of the G proteins in- 
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brain. J.D. Robishaw, V.K. Kalman, C.R. Moomaw, C.A. 
Slaughter, J Biol Chem, 264:15758-15761, 1989. 

Mutant G^S and Gy subunits may be designed in a manner 
analogous to that set forth with respect to Go? subunits. 

5 Combinatorial Libraries 

The term "library" generally refers to a collection of 
chemical or biological entities which are related in origin, 
structure, and/or function, and which can be screened 
simultaneously for a property of interest. 

10 The term "combinatorial library" refers to a library in 

which the individual members are either systematic or random 
combinations of a limited set of basic elements, the properties 
of each member being dependent on the choice and location of 
the elements incorporated into it.' Typically, the members of 

15 the library are at least capable of being screened 
simultaneously. Randomization may be complete or partial; some 
positions may be randomized and others predetermined, and at 
random positions, the choices may be limited in a predetermined 
manner. The members of a combinatorial library may be 

20 oligomers or polymers of some kind, in which the variation 
occurs through the choice of monomeric building block at one 
or more positions of the oligomer or polymer, and possibly in 
terms of the connecting linkage, or the length of the oligomer 
or polymer, too. Or the members may be nonoligomeric molecules 

25 with a standard core structure, like the 1 , 4 -benzodiazepine 
structure, with the variation being introduced by the choice 
of substituents at particular variable sites on the core 
structure. Or the members may be nonoligomeric molecules 
assembled like a jigsaw puzzle, but wherein each piece has both 

3 0 one or more variable moieties (contributing to library 
diversity) and one or more constant moieties (providing the 
functionalities for coupling the piece in question to other 
pieces) . 

The ability of one or more members of such a library to 
35 recognize a target molecule is termed "Combinatorial 
Recognition". In a "simple combinatorial library", all of the 
members, belong to the same class of compounds (e.g., peptides) 
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and can be synthesized simultaneously. A "composite 

combinatorial library" is a mixture of two or more simple 
libraries, e.g., DNAs and peptides, or benzodiazepine and 
carbamates. The number of component simple libraries in a 
5 composite library will, of course, normally be smaller than the 
average number of members in each simple library, as otherwise 
the advantage of a library over individual synthesis is small. 

Oligonucleotide Libraries 

An oligonucleotide library is a combinatorial library, at 

10 least some of whose members are single-stranded 
oligonucleotides having three or more nucleotides connected by 
phosphodiester or analogous bonds. The oligonucleotides may 
be linear, cyclic or branched, and may include non-nucleic acid 
moieties. The nucleotides are not limited to the nucleotides 

15 normally found in DNA or RNA. For examples of nucleotides 
modified to increase nuclease resistance and chemical stability 
of aptamers, see Chart 1 in Osborne and Ellington, Chem. Rev., 
97: 349-70 (1997). For screening of RNA, see Ellington and 
Szostak, Nature, 346: 818-22 (1990). 

2 0 There is no formal minimum or maximum size for these 

oligonucleotides. However, the number of conformations which 
an oligonucleotide can assume increases exponentially with its 
length in bases. Hence, a longer oligonucleotide is more 
likely to be able to fold to adapt itself to a protein surface. 
25 On the other hand, while very long molecules can be synthesized 
and screened, unless they provide a much superior affinity to 
that of shorter molecules, they are not likely to be found in 
the selected population, for the reasons explained by Osborne 
and Ellington (1997) . Hence, the libraries of the present 

3 0 invention are preferably composed of oligonucleotides having 

a length of 3 to 100 bases, more preferably 15 to 35 bases. 
The oligonucleotides in a given library may be of the same or 
of different lengths . 

Oligonucleotide libraries have the advantage that 
35 libraries of very high diversity (e.g., 10^^) are feasible, and 
binding molecules are readily amplified in vitro by polymerase 
chain reaction (PGR) . Moreover, nucleic acid molecules can 
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have very high specificity and affinity to targets. 

In a preferred embodiment, this invention prepares and 
screens oligonucleotide libraries by the SELEX method, as 
described in King and Famulok, Molec . Biol. Repts., 20: 97-107 
5 (1994); L. Gold, C. Tuerk. Methods of producing nucleic acid 
ligands, US#5595877; Oliphant et al . Gene 44:177 (1986). 

The term "aptamer" is conferred on those oligonucleotides 
which bind the target protein. Such aptamers may be used to 
characterize the target protein, both directly (through 
10 identification of the aptamer and the points of contact between 
the aptamer and the protein) and indirectly (by use of the 
aptamer as a ligand to modify the chemical reactivity of the 
protein) . 

Peptide Library 

15 A peptide library is a combinatorial library, at least 

some of whose members are peptides having three or more amino 
acids connected via peptide bonds. Preferably, they are at 
least five, six, seven or eight, amino acids in length. 
Preferably, they are composed of less than 50, more preferably 

20 less than 20 amino acids. 

The peptides may be linear, branched, or cyclic, and may 
include nonpeptidyl moieties. The amino acids are not limited 
to the naturally occurring amino acids. 

A biased peptide library is one in which one or more (but 

25 not all) residues of the peptides are constant residues. The 
individual members are referred to as peptide ligands (PL) . 
In one embodiment, an internal residue is constant, so that the 
peptide sequence may be written as 

(X,J,-AA,- (X^J, 

30 Where Xaa is either any naturally occurring amino acid, 

or any amino acid except cysteine, m and n are chosen 
independent ly from the range of 2 to 2 0, the Xaa may be the 
same or different, and AA-^ is the same naturally occurring 
amino acid for all peptides in the library but may be any amino 

35 acid. Preferably, m and n are chosen independently from the 
range of 4 to 9 . 

Preferably, AA-^ is located at or near the center of the 
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peptide. More specifically, it is desirable that m and n are 
not different by more than 2; more preferably m and n are 
equal. Even if the chosen AA-l is required (or at least 
permissive) of the target protein (TP) binding activity, one 
5 may need particular flanking residues to assure that it is 
properly positioned. If AA^ is more or less centrally located, 
the library presents numerous alternative choices for the 
flanking residues. If AAi is at an end, this flexibility is 
diminished . 

10 The most preferred libraries are those in which AA^ is 

tryptophan, proline or tyrosine. Second most preferred are 
those in which AA^ is phenylalanine, histidine, arginine, 
aspartate, leucine or isoleucine. Third most preferred are 
those in which AA-^ is asparagine, serine, alanine or 

15 methionine. The least preferred choices are cysteine and 
glycine. ,These preferences are based on evaluation of the 
results of screening random peptide libraries for binding to 
many different TPs. 

Ligands that bind to functional domains tend to have both 

20 constant as well as unique features. Therefore, by using 
"biased" peptide libraries, one can ease the burden of finding 
ligands. Either "biased" or "unbiased" libraries may be 
screened to identify "BioKey" peptides for use in developing 
reactivity descriptors , and, optionally, peptide aptamer 

25 descriptors and additional drug leads. 

Studies of Orphan Receptors 

Orphan receptors have been identified by virtue of their 
sequence similarity to known non-orphan receptors, however, by 
definition, they do not have known natural ligands. 

30 The first step in seeking to predict an orphan receptor- 

mediated biological activity of a compound is to identify at 
least one pharmacological agonist or antagonist of the orphan 
receptor. (Once such a compound is identified, the receptor 
is not longer strictly speaking an "orphan".) This ligand, 

35 which need not be a natural ligand of the receptor, is then 
used as a reference ligand to define a Biokey panel, etc. 

To identify an agonist or antagonist, a combinatorial 
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library is first screened for members which bind the receptor. 
Preferably, at least five, more preferably at least ten, 
distinct members are identified. Preferably, it should be 
demonstrable from competition experiments that more than one 
5 binding site is involved. 

Compounds are then screened for the ability to inhibit the 
binding of one or more of the aforementioned library members 
to the orphan receptor. Those which do so are likely to have 
altered the receptor conformation. These putative ligands are 

10 then screened for agonist or antagonist activity. The 
biological activities examined preferably include the 
activities native to those of the cognate receptors by 
reference to which the orphan receptors were originally 
identified. They also preferably include assays for cell 

15 proliferation for each cell type in which said orphan receptor 
is known (by detection of the receptor or its corresponding 
mRNA) to be expressed. 

The screened compounds may be small organic compounds, 
such as compounds from a suitable combinatorial or 

20 noncombinatorial library, or they may come from natural 
sources, such as serum, urine, cerebrospinal fluid, lymphatic 
fluid, or tissue extracts, which might harbor the natural 
ligand. Optionally, these natural source materials may be 
fractionated by conventional methods, and each fraction tested: 

25 The compounds may be known agonists or antagonists (or 
•analogues thereof) of the cognate receptor, but need not be. 

Small Organic Compound Library 

The small organic compound library ("compound library", 

for short) is a combinatorial library whose members are 
30 suitable for use as drugs if, indeed, they have the ability to 

mediate a biological activity of the target protein. 

Peptides have certain disadvantages as drugs. These 

include susceptibility to degradation by serum proteases, and 

difficulty in penetrating cell membranes. Preferably, all or 
35 most of the compounds of the compound library avoid, or at 

least do not suffer to the same degree, one or more of the 

pharmaceutical disadvantages of peptides. 
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In designing a compound library, it is helpful to bear in 
mind the methods of molecular modification typically used to 
obtain new drugs. Three basic kinds of modification may be 
identified: disjunction , in which a lead drug is simplified to 
5 identify its component pharmacophoric moieties; conjunction , 
in which two or more known pharmacophoric moieties, which may 
be the same or different, are associated, covalently or 
noncovalently , to form a new drug; and alteration , in which one 
moiety is replaced by another which may be similar or 

10 different, but which is not in effect a disjunction or 
conjunction. The use of the terms "disjunction", "conjunction" 
and "alteration" is intended only to connote the structural 
relationship of the end. product to the original leads, and not 
how the new drugs are actually synthesized, although it is 

15 possible that the two are the same. 

The process of disjunction is illustrated by the evolution 
of neostigmine (1931) and edrophonium (1952) from physost igmine 
(1925) , Subsequent conjunction is illustrated by demecarium 
(1956) and ambenonium (1956) . 

20 Alterations may modify the size, polarity, or electron 

distribution of an original moiety. Alterations include ring 
closing or opening, formation of lower or higher homologues, 
introduction or saturation of double bands, introduction of 
optically active centers, introduction, removal or replacement 

25 of bulky groups, isosteric or bioisosteric substitution, 
changes in the position or orientation of a group, introduction 
of alkylating groups, and introduction, removal or replacement 
of groups with a view toward inhibiting or promoting inductive 
(electrostatic or conjugative (resonance) effects. 

3 0 Thus, the substituents may include electron acceptors 

and/or electron donors. Typical electron donors (+1) include 
-CH3, -CH2R, -CHR2, -CR3 and -COO". Typical electron acceptors 
(-1) include -NH3+, -NR3+, -NO2, -CN, -COOH, -COOR, -CHO, -COR, 
-COR, -F, -CI, -Br, -OH, -OR, -SH, -SR, -CH=CH2, -CR=CR2, and 

35 -C=CH. 

The substituents may also include those which increase or 
decrease electronic density in conjugated systems. The former 
(+R) groups include -CH3, -CR3, -F, -CI, -Br, -I, -OH, -OR, 
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-OCOR, -SH, -SR, -NH2, -NR2, . and -NHCOR. The later (-R) 
groups include -NO2, -CN, -CHC, -COR, -COOH, -COOR, -CONH2, - 
SO2R and -CF3. 

Synthetically speaking, the modifications may be achieved 
5 by a variety of unit processes, including nucleophilic and 
electrophilic substitution, reduction and oxidation, addition 
elimination, double bond cleavage, and cyclization. 

For the purpose of constructing a library, a compound, or 
a family of compounds, having one or more pharmacological 

10 activities (which need not be related to the known or suspected 
activities of the target protein) , may be disjoined into two 
or more known or potential pharmacophoric moieties. Analogues 
of each of these moieties may be identified, and mixtures of 
these analogues reacted so as to reassemble compounds which 

15 have some similarity to the original lead compound. It is not 
necessary that all members of the library possess moieties 
analogous to all of the moieties of the lead compound. 

The design of a library may be illustrated by the example 
of the benzodiazepines. Several benzodiazepine drugs, 

2 0 including chlordiazepoxide , diazepam and oxazepam, have' been 
used on anti-anxiety drugs. Derivatives of benzodiazepines 
have widespread biological activities; derivatives have been 
reported to act not only as anxiolytics, but also as 
anticonvulsants, cholecystokinin (CCK) receptor subtype A or 

25 B, kappa opioid receptor, platelet activating factor, and HIV 
transactivator Tat antagonists, and GPIIblla, reverse 
transcriptase and ras farnesyltransf erase inhibitors. 

The benzodiazepine structure has been disjoined into a 2- 
aminobenzophenone , an amino acid, and an alkylating agent. See 

30 Bunin, et al . , Proc . Nat. Acad. Sci. USA, 91:4708 (1994). 
Since only a few 2 -aminobenzophenone derivatives are 
commercially available, it was later disjoined into 2- 
aminoarylstannane, an acid chloride, an amino acid, and an 
alkylating agent. Bunin, et al . , Meth. Enzymol . , 267:448 

35 (1996) . The arylstannane may be considered the core structure 
upon which the other moieties are substituted, or all four may 
be considered equals which are conjoined to make each library 
member . 
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A basic library synthesis plan and member structure is 
shown in Figure 1 of Fowlkes, et al . , U,S. Serial No. 
08/740,671, incorporated by reference in its entirety. The 
acid chloride building block introduces variability at the 
5 site. The site is introduced by the amino acid, and the R^ 
site by the alkylating agent. The R^ site is inherent in the 
arylstannane . Bunin, et al. generated a 1, 4 -benzodiazepine 
library of 11,200 different derivatives prepared from 20 acid 
chlorides, 35 amino acids, and 16 alkylating agents. (No 

10 diversity was introduced at R'^; this group was used to couple 
the molecule to a solid phase.) According to the Available 
Chemicals Directory (HDL Information Systems, San Leandro CA) , 
over 300 acid chlorides, 80 Fmoc-protected amino acids and 800 
alkylating agents were available for purchase (and more, of 

15 course, could be synthesized). The particular moieties used 
were chosen to maximize structural dispersion, while limiting 
the numbers to those conveniently synthesized in the wells of 
a microtiter plate. In choosing between structurally similar 
compounds, preference was given to the least substituted 

2 0 compound . 

. The variable elements included both aliphatic and aromatic 
groups. Among the aliphatic groups, both acyclic and cyclic 
(mono- or poly-) structures, substituted or not, were tested. 
(While all of the acyclic groups were linear, it would have 
25 been feasible to introduce a branched aliphatic) . The aromatic 
groups featured either single and multiple rings, fused or not, 
substituted or not, and with heteroatoms or not. The secondary 
substitutents included -NH2, -OH, -OMe, -CN, -CI, -F, and - 
COOH. While not used, spacer moieties, such as -0-, -S-, -00-, 

3 0 -CS-, -NH-, and -NR- , could have been incorporated. 

Bunin et al . suggest that instead of using a 1, 4- 
benzodiazepine as a core structure, one may instead use a 1, 
4 -benzodiazepine -2 , 5-dione structure . 

As noted by Bunin et al . , it is advantageous, although not 
35 necessary, to use a linkage strategy which leaves no trace of 
the linking functionality, as this permits construction of a 
more diverse library. 

Other combinatorial nonoligomeric compound libraries known 
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or suggested in the art have been based on carbamates, 
mercaptoacylated pyrrolidines, phenolic agents, aminimides, N- 
acylamino ethers (made from amino alcohols, aromatic hydroxy 
acids, and carboxylic acids) , N-alkylamino ethers (made from 
5 aromatic hydroxy acids, amino alcohols and aldehydes) 1, 4- 
piperazines, and 1, 4-piperazine-6-ones . 

DeWitt, et al . , Proc . Nat. Acad. Sci. (USA), 90:6909-13 
(1993) describes the simultaneous but separate, synthesis of 
40 discrete hydantoins and 40 discrete benzodiazepines. They 

10 carry out their synthesis on a solid support (inside a gas 
dispersion tube) , in an array format, as opposed to other 
conventional simultaneous synthesis techniques (e.g., in a 
well, or on a pin) . The hydantoins were synthesized by first 
simultaneously deprotecting and then treating each of five 

15 amino acid resins with each of eight isocyanates. The 
benzodiazepines were synthesized by treating each of five 
deprotected amino acid resins with each of eight 2 -amino 
benzophenone imines . 

Chen, et al . , J. Am. Chem. Soc . , 116:2661-62 (1994) 

20 described the preparation of a pilot (9 member) combinatorial 
library of formate esters. A polymer bead-bound aldehycie 
preparation was "split" into three aliquots, each reacted with 
one of three different ylide reagents. The reaction products 
were combined, and then divided into three new aliquots, each 

25 of which was reacted with a different Michael donor. Compound 
identity was found to be determinable on a single bead basis 
by gas chromatography/mass spectroscopy analysis. 

Holmes, USP 5,549,974 (1996) sets forth methodologies for 
the combinatorial synthesis of libraries of thiazolidinones and 

30 metathiazanones . These libraries are made by combination of 
amines, carbonyl compounds, and thiols under cyclization 
conditions . 

Ellman, USP 5,545,568 (1996) describes combinatorial 
synthesis of benzodiazepines, prostaglandins, beta- turn 
35 mimetics, and glycerol-based compounds. See also Ellman, USP 
5, 288, 514 . 

Summerton, USP 5,506,337 (1996) discloses methods of 
preparing a combinatorial library formed predominantly of 
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morpholino subunit structures. 

Heterocylic combinatorial libraries are reviewed generally 
in Nefzi, et al . , Chem. Rev., 97:449-472 (1997). One or more 
moieties of the following types may be incorporated into 
5 compounds of the library, as many drugs fall into one or more 
of the following categories: 

acetals 

acids 

alcohols 
10 amides 

amidines 
\l amines 
fy amino acids 

IlJ amino alcohols 

15 amino ethers 

14, amino ketenes 

f~l ammonium compounds 

'^Z. azo compounds 

'^^"^ enols 
20 esters 

ethers 

glycosides 

guanidines 

halogenated compounds 

2 5 hydrocarbons 

ketones 
lactams 

lactones ^ 
mustards 

3 0 nitro compounds 
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nitroso compounds 
organo minerals 
phenones 
quinones 
5 semicarbazones 
stilbenes 
sulfonamides 
sulfones 
thiols 
10 thioamides 
thioureas 
ureas 
ureides 
urethans 

15 Without attempting to exhaustively recite all 

pharmacological classes of drugs, or all drug structures, one 
or more compounds of the chemical structures listed below have 
been found to exhibit the indicated pharmacological activity, 
and these structures , or derivatives , may be used as design 
2 0 elements in screening for further compounds of the same or 
different activity. (In some cases, one or more lead drugs of 
the class are indicated.) 
hypnotics 

higher alcohols (clomethiazole) 
2 5 aldehydes (chloral hydrate) 

carbamates (meprobamate) 
acyclic ureides (acetylcarbromal ) 
barbiturates (barbital ) 
benzodiazepine (diazepam) 
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anticonvulsants 

barbiturates (phenobarbital ) 
hydantoins (phenytoin) 
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oxazolidinediones ( trimethadione) 
succinimides (phensuximide ) 
acylureides (phenacemides ) 



narcotic analgesics 
5 morphines 

phenylpiperidines (meperidine ) 
diphenylpropylamines (methadone ) 
phenothiazihes (methotrimeprazine) 

analgesics , antipyretics , antirheumatics 
10 salicylates (acetylsalicylic acid') 

p-aminophenol (acetaminophen) 
5 -pyrazolone (dipyrone ) 

3, 5-pyrazolidinedione (phenylbutazone) 
arylacetic acid (indomethacin) 
15 adrenocortical steroids (cortisone, dexamethasone , 

prednisone , triamcilone) 
athranilic acids 

neuroleptics 

phenothiazine (chlorpromazine) 
20 thioxanthene (chlorprothixene) 

reserpine 

butyrophenone (halopendol ) 

anxiolytics 

propandiol carbamates (meprobamate) 
25 benzodiazepines (chlordiazepoxide , diazepam, 

oxazepam) 

ant idipressants 

tricyclics ( imipramine) 

muscle /relaxants 
3 0 propanediols and carbamates (mephenesin) 



CNS stimulants 
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xanthines (caffeine, theophylline) 
phenylalkylamines (amphetamine) 

(Fenetylline is a conjunction of theophylline and 
amphetamine ) 
5 oxazolidinones (pemoline) 

cholinergics 

choline esters (acetylcholine) 
N, N-dimethylcarbamates 

adrenergics 

10 aromatic amines (epinephrine, isoproterenol, 

phenylephrine ) 
alicyclic amines (cyclopentamine) 
\2 aliphatic amines (methylhexaneamine) 

l^. imidazolines (naphazoline) 

15 anti- adrenergics 

M indole thylamine alkaloids (dihydroergot amine) 

f imidazoles (tolazoline) 

f 3 benzodioxans (piperoxan) 

ry beta -haloa Iky lamines (phenoxybenz amine) 

'^Z 20 dibenzazepines (azapetine) 

s5 hydrazinophthalazines ^hydralazine) 

antihistamines ^ 
ethanolamines (diphenhydramine) 
ethylenediamines ( tripelennomine) 

2 5 alkylamines ( chlorpheniramine ) 

piperazines (cyclizine) 
phenothiazines (promethazine) 

local anesthetics 
benzoic acid 

3 0 esters (procaine , isobucaine , cyclomethycaine ) 

basic amides (dibucaine) 

anil ides , toluidides , 2 , 6 -xylidides ( lidocaine) 
tertiary amides (oxetacaine) 
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vasodilators 

polyol nitrates (nitroglycerin) 

diuretics 

xanthines 

5 thiazides (chlorothiazide) 

sulfonamides (chlorthalidone) 

antihelmintics 

cyanine dyes 

antimalarials 
10 4 -aminoquinolines 

8 -aminoquinolines 

pyrimidines 

biguanides 

acridines 
15 dihydrotriazines 

sulfonamides 

sulf ones 

antibacterials 

antibiotics 
20 penicillins 

cephalosporins 

octahydronapthacenes (tetracycline) 
sulfonamides 
nitrof urans 

2 5 eye lie amine s 

naphthyridines 
xylenols 

antitumor 

alkylating agents 

3 0 nitrogen mustards 

aziridines 

methanesulf onate esters 
epoxides 
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amino acid antagonists 
folic acid antagonists 
pyrimidine antagonists 
purine antagonists 



5 antiviral 

adamantanes 
nucleosides 
thiosemicarbazones 
inosines 

10 amidines and guanidines 

isoquinolines 
benz imidazoles 
piperazines 

For pharmacological classes, see, e.g., Goth, Medical 
15 Pharmacology: Principles and Concepts (C.V. Mosby Co. : 8th ed. 
1976); Korolkovas and Burckhalter, Essentials of Medicinal 
Chemistry (John Wiley & Sons, Inc.: 1976). For synthetic 
methods, see, e.g.. Warren, Organic Synthesis: The 
Disconnection Approach (John Wiley & Sons, Ltd. : 19 82) ; Fuson, 
2 0 Reactions of Organic Compounds (John Wiley & Sons: 1966) ; Payne 
and Payne, How to do an Organic Synthesis (Allyn and Bacon, 
Inc. : 1969) ; Greene, Protective Groups in Organic Synthesis 
(Wiley- Interscience) . For selection of substituents , see e.g. , 
Hansch and Leo, Substituent Constants for Correlation Analysis 
25 in Chemistry and Biology (John Wiley & Sons: 1979) . 

The library is preferably synthesized so that the 
individual members remain identifiable so that, if a member is 
shown to be active, it is not necessary to analyze it. Several 
methods of identification have been proposed, including: 
30 (1) encoding, i.e., the attachment to each member of an 

identifier moiety which is more readily identified 
than the member proper. This has the disadvantage 
that the tag may itself influence the activity of 
the con j ugate . 

35 (2) spatial addressing, e.g., each member is synthesized 

only at a particular coordinate on or in a matrix, 
or in a particular chamber. This might be, for 
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example, the location of a particular pin, or a 
particular well on a microtiter plate, or inside a 
"tea bag" . 

The present invention is not limited to any particular form of 
5 identification. 

However, it is possible to simply characterize those 
members of the library which are found to be active, based on 
the characteristic spectroscopic indicia of the various 
building blocks. 

10 Solid phase synthesis permits greater control over which 

derivatives are formed. However, the solid phase could 
interfere with activity. To overcome this problem, some or all 
of the molecules of each member could be liberated, after 
synthesis but before screening. 
15 Examples of candidate simple libraries which might be 

evaluated include derivatives of the following: 
Cyclic Compounds Containing One Hetero Atom 
Heteronitrogen 
pyrroles 

20 pentasubstituted pyrroles 

pyrrolidines 
pyrrolines 
prolines 
indoles 

2 5 beta-carbolines 

pyridines 

dihydropyridines 

1 , 4 -dihydropyridines 

pyrido [2 , 3-d] pyrimidines 
30 tetrahydro-3H-imidazo [4 , 5-c] pyridines 

Isoquinolines 

tetrahydroisoquinolines 
quinolones 
beta- lactams 

35 azabicyclo [4 . 3 . 0] nonen- 8 -one amino acid 

Heterooxygen 
f urans 

tetrahydrof urans 
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2 , 5-disubstituted tetrahydrof urans 

pyrans 

hydr oxypyr anone s 
t e t rahydr oxypyr anone s 
5 gamma -butyrolactones 

Heterosulf ur 

sulf olenes 

Cyclic Compounds with Two or More Hetero atoms 
Multiple heteronitrogens 
10 imidazoles 

pyrazoles 
piperazines 

diketopiperazines 
arylpiperazines 
15 benzylpiperazines 

benzodiazepines 
1, 4 -benzodiazepine-2 , 5-diones 
hydantoins 

5 -alkoxyhydantoins 

2 0 dihydropyrimidines 

1 , 3 -disubstituted-5 , 6 -dihydopyrimidine-2 , 4 - 

diones 
cyclic ureas 
cyclic thioureas 
25 quinazolines 

chiral3 -substituted-quinazoline-2 , 4 -diones 
triazotes 

1,2, 3 - triazoles 
purines 

3 0 Heteronitrogen and Heterooxygen 

dikelomorpholines 
isoxazoles 
isoxazo lines 
Heteronitrogen and Heterosulfur 
35 thiazolidines 

N-axylthiazoli dines 
dihydrothiazoles 
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2-methylene-2 , 3 -dihydrothiazates 
2-aminothiazoles 
thiophenes 

3 - amino thiophenes 
5 4 - thiazolidinones 

4 -melathiazanones 
benzisothiazolones 
For details on synthesis of libraries, see Nefzi, et al . , 
Chem. Rev., 97:449-72 (1997), and references cited therein. 

10 Amino Acids and Peptides 

Amino acids are the basic building blocks with which 
peptides and proteins are constructed. Amino acids possess 
both an amino group (-NH2) and a carboxylic acid group (-COOH) . 
Many amino acids, but not all, have the structure NH2-CHR-COOH, 

15 where R is hydrogen, or any of a variety of functional groups. 

Twenty amino acids are genetically encoded: Alanine, 
Arginine, Asparagine, Aspartic Acid, Cysteine, Glutamic Acid, 
Glutamine, Glycine, Histidine, Isoleucine, Leucine, Lysine, 
Methionine, Phenylalanine, Proline, Serine, Threonine, 

20 Tryptophan, Tyrosine, and Valine. Of these, all save Glycine 
are optically isomeric, however, only the L-form is found in 
humans. Nevertheless, the D-forms of these amino acids do have 
biological significance; D-Phe, for example, is a known 
analgesic . 

25 Many other amino acids are also known, including: 2- 

Aminoadipic acid; 3 -Aminoadipic acid; beta-Aminopropionic acid; 
2-Aminobutyric acid; 4 -Aminobutyric acid (Piperidinic acid) ; 
6-Aminocaproic acid; 2 -Aminoheptanoic acid; 2 -Aminoisobutyric 
acid, 3 -Aminoisobutyric acid; 2 -Aminopimelic acid; 

30 2 , 4 -Diaminobutyric acid; Desmosine; 2 , 2 ' -Diaminopimelic acid; 
2 , 3 -Diaminopropionic acid; N-Ethylglycine ; N-Ethylasparagine ; 
Hydroxylysine ; allo-Hydroxylysine ; 3 -Hydroxyproline ; 
4 -Hydroxyproline ; Isodesmosine ; alio- Isoleucine ; N- 

Methylglycine (Sarcosine) ; N-Methylisoleucine ; N-Methylvaline ; 

3 5 Norvaline; Nor leucine; and Ornithine. 

Peptides are constructed by condensation of amino acids 
and/or smaller peptides. The amino group of one amino acid (or 
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peptide) reacts with the carboxylic acid group of a second 
amino acid (or peptide) to form a peptide (-NHCO-) bond, 
releasing one molecule of water. Therefore, when an amino acid 
is incorporated into a peptide, it should, technically 
5 speaking, be referred to as an amino acid residue. 

A peptide is composed of a plurality of amino acid 
residues joined together by peptidyl (-NHCO-) bonds. A 
biogenic peptide is a peptide in which the residues are all 
genetically encoded amino acid residues; it is not necessary 
10 that the biogenic peptide actually be produced by gene 
expression . 

The peptides of the present invention include peptides 
whose sequences are disclosed in this specification, or 
sequences differing from the above solely by no more than one 

15 nonconservative substitution and/or one or more conservative 
substitutions, preferably no more than a single conservative 
substitution. The substitutions may be of non-genetically 
encoded (exotic) amino acids, in which case the resulting 
peptide is nonbiogenic. Preferably, the peptides are biogenic. 

20 If the peptide is being expressed in a cell, all of its 

amino acids must be biogenic (unless the cell is engineered to 
alter certain amino acids post -expression, or the peptide is 
recovered and modified in vitro) . If it is produced 

nonbiologically (e.g., Merrif ield- type synthesis) or by 

25 semisynthesis , it may include nonbiogenic amino acids. 

Additional peptides within the present invention may be 
identified by systematic mutagenesis of the lead peptides, e.g. 

(a) separate synthesis of all possible single 
substitution (especially of genetically encoded AAs) 

3 0 mutants of each lead peptide, and/or 

(b) simultaneous binomial random alanine-scanning 
mutagenesis of each lead peptide, so each amino 
acids position may be either the original amino acid 
or alanine (alanine being a semi -conservative 

35 substitution for all other amino acids) , and/or 

(c) simultaneous random mutagenesis sampling 
conservative substitutions of some or all positions 
of each lead peptide, the number of sequences in 
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total sequences space for a given experiment being 
such that any sequence, if active, is within 
detection limits (typically, this means not more 
than about 10^° different sequences) . 
5 Substitutions are preferably at sites shown to tolerate 

mutation by the mutagenic strategies set forth above. 

The mutants are tested for activity, and, if active, are 
considered to be within "peptides of the present invention" . 
Even inactive mutants contribute to our knowledge of structure - 
10 activity relationships and thus assist in the design of 
peptides, peptoids, and peptidomimetics . 

The core sequences of the peptides may be identified by 
systematic truncation, starting at the N-terminal, the C- 
terminal, or both simultaneously or sequentially. The 
15 truncation may be one amino acid at a time, but preferably, to 
speed up the process, is of 10-50% of the molecule at one time. 
If a given truncation is unsuccessful, one retreats to a less 
dramatic truncation intermediate between the last successful 
truncation and the last unsuccessful truncation. 
20 Most extensions should be tolerated. However, if one is 

not, it may be helpful to introduce a linker, such as one made 
primarily of amino acids such as Glycine (introduces 
flexibility) , and Proline (introduce a rigid extension) , or 
other amino acids favored in protein turns, loops and 
25 interdomain boundaries. Indeed, the sequences of such segments 
may be used directly as linkers. 

Preferably, substitutions of exotic amino acids for the 
original amino acids take the form of 

(I) replacement of one or more Hydrophilic amino 
3 0 acid side chains with another hydrophilic 

organic radical, not more than twice the volume 
of the original side chain, or 

(II) replacement of one or more hydrophobic amino 
acid side chains with another hydrophobic 

3 5 organic radical, not more than twice the volume 

of the original side chain. 
The exotic amino acids may be alpha or non-alpha amino 
acids (e.g., beta alanine) . They may be alpha amino acids with 
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2 R groups on the Cof, which groups may be the same or 
different. They may be dehydro amino acids (HOOC-C {NH2) =CHR) . 

For further information on synthesis of peptides including 
exotic amino acids, see: 
5 1. Bielfeldt, T., Peters, S., Meldal, M., Bock, K. and 

Paulsen, N.A. new strategy for solid-phase synthesis of O- 
glycopeptides . Angew. Chem. (Engl) 31:857-859, 1992. 

2. Gurjar, M.K. and Saha, U.K. Synthesis of the 
glycopeptide-O- ( 3 , 4 - di -O -methyl - 2 - 0 - [3 , 4 -di -O-methyl -of-L- 

10 rhamnopyranosyl] -Qf-L-rhamnophyranosyl) -L-alanilol : An unusual 
part structure in the glycopeptidolipid of Mycobacterium 
f ortuitum . Tetrahedron 48:4039-4044, 1992. 

3. Kessler, H., Wittmann, V., Kock, M. and Kottenhahn, 
M. Synthesis of C-glycopept ides via free radical addition of 

15 glycosyl bromides to dehydroalanine derivatives. Angew. Chem. 
(Engl.) 31:902-904, 1992. 

4. Kraus, J.L. and Attardo, G. Synthesis and biological 
activities of new f ormylated methionyl peptides containing 
an Qf-substituted glycine residue. European Journal of 

20 Medicinal Chemistry 27 : 19-26 , 1992. 

5. Mhaskar, S.Y. Synthesis of N-lauroyl dipeptides and 
correlation of their structure with surfactant and 
antibacterial properties. J. Am. Oil Chem. S^oc. 69 : 647-652 , 
1992 . 

25 6. Moree, W.J., Van der Marel, G.A. and Liskamp, R.M.J. 

Synthesis of peptides containing the j8-substituted aminoethane 
sulfinamide or sulfonamide transition-state isostere derived 
from amino acids. Tetrahedron Lett. 33:69-6392, 1992. 

7. Paquet, A. Further studies on the use of 2,2,2- 
3 0 trichloroethyl groups for phosphate protection in phosphoserine 

peptide synthesis. International Journal of Peptide and 
Protein Research 39:82-86, 1992. 

8. Sewald, N., Riede, J., Bissinger, P. and Burger, K. 
A new convenient synthesis of 2 - trif luoromethyl substituted 

35 aspartic acid and its isopeptides. Part 11. Journal of the 
Chemical Society. Perkin Transactions 1 1992:267-274, 1992. 

9. Simon, R.J., Kania, R.S., Zuckermann, R.N., Huebner, 
V.D., Jewell, D.A. , Banville, S., Ng, S., Wang, L. . , Rosenberg, 
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S., Marlowe, C.K., Spellmeyer, D.C., Tan, R. , Frankel, A.D., 
Santi, D.V., Cohen, F.E, andBartlett, P. A. Peptoids : A modular 
approach to drug discovery. Proc. Na.tl, Acad, Sci . USA 

89:9367-9371, 1992 . 
5 10. Tung, C.-H., Zhu, T., Lackland, H. and Stein, S. An 

acridine amino acid derivative for use in Fmoc peptide 
synthesis. Peptide Research 5:115-118, 1992. 

11. Elofsson, M. Building blocks for glycopeptide 
synthesis: Glycosylat ion of 3 -mercaptopropionic acid and Fmoc 

10 amino acids with unprotected carboxyl groups. Tetrahedron 
Lett. 32:7613-7616, 1991. 
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Cyclic Peptides 

Many naturally occurring peptide are cyclic. Cyclization 
5 is a common mechanism for stabilization of peptide conformation 
thereby achieving improved association of the peptide with its 
ligand and hence improved biological activity. Cyclization is 
usually achieved by intra-chain cystine formation^ by formation 
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10 terminals. Cyclization was usually achieved by peptides in 
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describe cyclization of peptides on beads (see references 
below) . 
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Peptoid 

5 A peptoid is an analogue of a peptide in which one or more 

of the peptide bonds are replaced by pseudopeptide bonds, which 
may be the same or different. 

Such pseudopeptide bonds may be: 
Carba ^(CH2-CH2) 
10 Depsi ^(CO-O) 

Hydroxyethylene ^(CHOH-CH2) 
Ketomethylene St^(CO-CH2) 
Me t hy 1 ene - ocy CH2 - O - 
Reduced CH2-NH 
15 Thiomethylene CH2-S- 

Thiopeptide CS-NH 
N-modified -NRCO- 
See also 
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Solid phase synthesis of peptides containing the non- 
gydrolysable analog of (O) phosphotyrosine , p (CH2PO3H2) Phe . 
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Peptidomimetic 

A peptidomimetic is a molecule which mimics the biological 
activity of a peptide, by substantially duplicating the 
pharmacologically relevant portion of the conformation of the 
5 peptide, but is not a peptide or peptoid as defined above. 
Preferably the peptidomimetic has a molecular weight of less 
than 700 daltons. 

Designing a peptidomimetic usually proceeds by: 

(a) identifying the pharmacophoric groups responsible 
10 for the activity; 

(b) determining the spatial arrangements of the 
pharmacophoric groups in the active conformation of 
the peptide; and 

(c) selecting a pharmaceut ically acceptable template 
15 upon which to mount the pharmacophoric groups in a 

manner which allows them to retain their spatial 
arrangment in the active conformation of ' the 
peptide. 

Step (a) may be carried out by preparing mutants of the 

20 active peptide and determining the effect of the mutation on 
activity. One may also examine the 3D structure of a complex 
of the peptide and the receptor for evidence of interactions, 
e.g., the fit of a side chain of the peptide into a cleft of 
the receptor; potential sites for hydrogen bonding, etc.) . 

25 Step (b) generally involves determining the 3D structure 

of the active peptide, in the complex, by NMR spectroscopy or 
X-ray diffraction studies. The initial 3D model may be refined 
by an energy minimization and molecular dynamics simulation. 

Step (c) may be carried out by reference to a template 

30 database, see Wilson, et al . Tetrahedron, 49:3655-63 (1993). 
The templates will typically allow the mounting of 2-8 
pharmacophores, and have a relatively rigid structure. For the 
latter reason, aromatic structures, such as benzene, biphenyl, 
phenanthrene and benzodiazepine, are preferred. For orthogonal 

35 protection techniques, see Tuchscherer, et al., Tetrahedron, 
17:3559-75 (1993) . 

For more information on peptoids and peptidomimetics , see 
USP 5,811,392, USP 5,811,512, USP 5,578,629, USP 5,817,879, USP 
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5,817,757, USP. 5,811,515. 
Analogues 

Also of interest are analogues of the disclosed peptides, 
and other compounds with activity of interest. 
5 Analogues may be identified by assigning a hashed bitmap 

structural fingerprint to the compound, based on its chemical 
structure, and determining the similarity of that fingerprint 
to that of each compound in a broad chemical database. The 
fingerprints are determined by the fingerprinting software 

10 commercially distributed for that purpose by Daylight Chemical 
Information Systems, Inc., according to the software release 
current as of January 8, 1999. In essence, this algorithm 
generates a bit pattern for each atom, and for its nearest 
neighbors, with paths up to 7 bonds long. Each pattern serves 

15 as a seed to a pseudorandom number generator, the output of 
which is a set of bits which is logically ored to the 
developing fingerprint. The fingerprint may be fixed or 
variable size. 

The database may be SPRESI'95 (InfoChem GmbH), Index 
20 , Chemicus (ISI), MedChem (Pomona/Biobyte) , World Drug Index 
(Derwent) , TSCA93 (EPA) May bridge organic chemical catalog 
(Maybridge) , Available Chemicals Directory (MDLIS Inc.), NCI96 
(NCI), Asinex catalog of organic compounds (Asinex Ltd.)-, or 
IBIOScreen SC and NP (Inter BioScreen Ltd.), or an inhouse 
25 database. 

A compound is an analogue of a reference compound if it 
has a daylight fingerprint with a similarity (Tanamoto 
coefficient) of at least 0.85 to the Daylight fingerprint of 
the reference compound. 

3 0 A compound is also an analogue of a reference compound id 

it may be conceptually derived from the reference compound by 
isosteric replacements . 

Homologues are compounds which differ by an increase or 
decrease in the number of methylene groups in an alkyl moiety. 

35 Classical isosteres are those which meet Erlenmeyer's 

definition: "atoms, ions or molecules in which the peripheral 
layers of electrons can be considered to be identical" . 
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Classical isos teres include 

Monovalents Bivalents Trivalents Tetra Annular 

F, OH, NH2, CH3 -O- -N= =C= -CH = CH- 

-Si = 

5 CI, SH, PH2 -S- -P= -N+= -S- 

Br -Se- -As- =P+= -O- 

i -Te- -Sb- =As+= -NH- 

-CH= =Sb+= 
Nonclassical isosteric pairs include -CO- and -SO2-, -COOH 

10 and -SO3H, -SO2NH2 and -PO(OH)NH2, and -H and -F, -0C(=0)- and 
C(=0)0-, -OH and -NH2. 

C3 Pharmaceutical Methods and Preparations 

''S The preferred animal subject of the present invention is 

fU a mammal. By the term "mammal" is meant an individual 

15 belonging to the class Mammalia. The invention is particularly 
lij useful in the treatment of human subjects, although it is 

intended for veterinary uses as well. Preferred nonhuman 
subjects are of the orders Primata (e.g., apes and monkeys), 
j-f Artiodactyla or Perissodactyla (e.g., cows, pigs, sheep, 

20 horses, goats), Carnivora (e.g., cats, dogs), Rodenta (e.g., 
^.3 rats, mice , guinea pigs , hamsters), Lagomorpha (e.g., rabbits) 

or other pet, farm or laboratory mammals. 

The term "protection", as used herein, is intended to 
include "prevention," "suppression" and "treatment." 
25 "Prevention" involves administration of the protein prior to 
the induction of the disease (or other adverse clinical 
condition) . "Suppression" involves administration of the 
composition prior to the clinical apioearance of the disease. 
"Treatment" involves administration of the protective 
3 0 composition after the appearance of the disease. Protection, 
including prevention, need not be absolute. 

It will be understood that in human and veterinary 
medicine, it is not always possible to distinguish between 
"preventing" and "suppressing" since the ultimate inductive 
35 event or events may be unknown, latent, or the patient is not 
ascertained until well after the occurrence of the event or 
events. Therefore, it is common to use the term "prophylaxis" 
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as distinct from "treatment" to encompass both "preventing" and 
"suppressing" as defined herein. The term "protection," as 
used herein, is meant to include "prophylaxis." It should also 
be understood that to be useful, the protection provided need 
5 not be absolute, provided that it is sufficient to carry 
clinical value. An agent which provides protection to a lesser 
degree than do competitive agents may still be of value if the 
other agents are ineffective for a particular individual, if 
it can be used in combination with other agents to enhance the 
10 level of protection, or if it is safer than competitive agents. 
The drug may provide a curative effect, an ameliorative effect, 
or both. 

At least one of the drugs of the present invention may be 
administered, by any means that achieve their intended purpose, 

15 to protect a subject against a disease or other adverse 
condition. The form of administration may be systemic or 
topical. For example, administration of such a composition may 
be by various parenteral routes such as subcutaneous, 
intravenous , intradermal , intramuscular , intraperitoneal , 

20 intranasal, transdermal, or buccal routes. Alternatively, or 
concurrently, administration may be by the oral route. 
Parenteral administration can be by bolus injection or by 
gradual perfusion over time. 

A typical regimen comprises administration of an effective 

25 amount of the drug, administered over a period ranging from a 
single dose, to dosing over a period of hours, days, weeks, 
months, or years. 

It is understood that the suitable dosage of a drug of the 
present invention will be dependent upon the age, sex, health, 

30 and weight of the recipient, kind of concurrent treatment, if 
any, frequency of treatment, and the nature of the effect 
desired. However, the most preferred dosage can be tailored 
to the individual subject, as is understood and determinable 
by one of skill in the art, without undue experimentation. 

35 This will typically involve adjustment of a standard dose, 
e.g., reduction of the dose if the patient has a low body 
weight . 

Prior to use in humans, a drug will first be evaluated for 
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safety .and efficacy in laboratory animals. In human clinical 
studies, one would begin with a dose expected to be safe in 
humans, based on the preclinical data for the drug in question, 
and on customary doses for analogous drugs (if any) . If this 
5 dose is effective, the dosage may be decreased, to determine 
the minimum effective dose, if desired. If this dose is 
ineffective, it will be cautiously increased, with the patients 
monitored for signs of side effects. See, e.g., Berkow et al , 
eds . , The Merck Manual, 15th edition, Merck and Co., Rahway, 

10 N.J., 1987; Goodman et al . , eds., Goodina.n a.nd Gllman^s The 
Pharmacol og-ical Basis of Therapeutics , 8th edition, Pergamon 
Press, Inc., Elmsford, N.Y., (1990); Avery^s Drug Treatments- 
Principles and Practice of Clinical Pharmacology and 
Therapeutics, 3rd edition, ADIS Press, LTD., Williams and 

15 Wilkins, Baltimore, MD. (1987) , Ebadi, Pharmacology, Little, 
Brown and Co., Boston, (1985), which references and references 
cited therein, are entirely incorporated herein by reference.. 

The total dose required for each treatment may be 
administered by multiple doses or in a single dose. The. 

20 protein may be administered alone or in conjunction with other 
therapeutics directed to the disease or directed to other 
symptoms thereof . 

The appropriate dosage form will depend on the disease, 
the protein, and' the mode of administration; possibilities 

25 include tablets, capsules, lozenges, dental pastes, 
suppositories, inhalants, solutions, ointments and parenteral 
depots. See, e.g., Berker, supra, Goodman, supra, Avery, supra 
and Ebadi, supra, which are entirely incorporated herein by 
reference, including all references cited therein. 

3 0 In the case of peptide drugs, the drug may be adminstered 

in the form of an expression vector comprising a nucleic acid 
encoding the peptide, such a vector, after in corporation into 
the genetic complement of a cell of the patient, directs 
synthesis of the peptide. Suitable vectors include genetically 

35 engineered poxviruses (vaccinia) , adenoviruses, adeno- 
associated viruses, herpesviruses and lentiviruses which are 
or have been rendered nonpathogenic . 

In addition to at least one drug as described herein, a 
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pharmaceutical composition may contain suitable 
pharmaceutically acceptable carriers, such as excipients, 
carriers and/or auxiliaries which facilitate processing of the 
active compounds into preparations which can be used 
5 pharmaceutically. See, e.g., Berker, supra, Goodman, supra., 
Avery, supra, and Ebadi, supra, which are entirely incorporated 
herein by reference, included all references cited therein. 

Anti-Cancer Utility 

One utility of certain ER-binding peptides of the present 

10 invention, and related peptoids , peptidomimetics and analogues , 
and compounds fingerprinted as sensitive to the interaction of 
such peptides with ER, is in circumventing tamoxifen resistance 
in breast cancer. 

It is now estimated that the lifetime risk among American 

15 women of being diagnosed with breast cancer is about one in 
eight. Although this figure represents a doubling of the 
incidence of this disease over the past fifty years, it .is 
counterbalanced by the observation that mortality from this 
disease has decreased slightly over the same period. In the 

20 recent NSAPB-B14 trial it was demonstrated that the 10 year 
survival rate in breast cancer patients who were node negative 
at time of diagnosis was greater than 80%. It is likely that 
this favorable response is due in large part to advances in 
early detection which has had the effect of decreasing the 

25 number of women who present with metastatic disease to more 
manageable early stage malignancies. In addition to early 
detection however, the strategic use of the antiestrogen 
tamoxifen for the treatment of metastatic disease and as an 
adjuvant chemotherapeutic has had a positive impact on survival 

30 in breast cancer patients. One of the most dramatic benefits 
of tamoxifen is that it reduces the incidence of contralateral 
primary tumors in patients by greater than 50%. It was this 
finding, combined with the results of the NSABP-Pl 
chemoprevention trial, which led recently to the approval of 

3 5 tamoxifen for use as a breast cancer chemopreventat ive in women 
who are at an elevated risk for breast cancer. Clearly, 
tamoxifen is an extremely successful pharmaceutical. As with 
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most drugs however, the effectiveness of tamoxifen as a 
chemotherapeutic agent decreases with time. In the metastatic 
setting, it has been observed that most tamoxifen responsive 
breast cancers eventually become resistant to its 
5 antiestrogenic actions. A decrease in effectiveness over time 
in the adjuvant setting is also inferred from the results of 
the NSABP-B14 trial which demonstrated that the overall 
survival rate of breast cancer patients who were asking 
tamoxifen for 10 years was no better, and possibly even worse, 

10 than women who took this drug for only five years. The latter 
result has led to the suggestion that the tumors in patients 
who were on tamoxifen for extended periods of time may loose 
the ability to recognize this drug as an antiestrogen and may 
in fact change in some manner to respond to the drug as an 

15 estrogen. The observation that some patients display a 
withdrawal response when tamoxifen administration is 
discontinued supports this hypothesis. Consequently, there has 
been a tremendous amount of interest in understanding the 
process by which breast tumors fail tamoxifen and in the 

2 0 application of this knowledge to the development of novel 

ant iestrogens with improved therapeutic benefits. 

Several years ago it was considered unlikely that the 
estrogen receptor (ER) would be a useful target in those cells 
which have failed tamoxifen. -However, the emergence of pure 
25 antiestrogens, like ICI182,780, which have been used 
successfully to treat tamoxifen refractory breast cancers has 
validated ER as a target in this stage of the disease. 
However, since they non- select ively block estrogen action in 
all target organs they will have a negative impact in the 

3 0 skeletal and cardiovascular systems and consequently will not 

be suitable for use as adjuvant chemotherapeutics . There is 
an unmet medical need therefore, for novel antiestrogens which 
are mechanistically distinct from tamoxifen in the breast but 
which retain the positive estrogenic actions of tamoxifen in 
3 5 the bond and the cardiovascular systems. 

Tamoxifen was developed originally as an antiestrogen 
which could be used to block the actions of estrogen at the 
receptor level in breast cancer, cells . Thus, it was generally 
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held that resistance to this agent occurred as a consequence 
of ER mutations, selective extrusion of the compound from cells 
or as a result of inactivating metabolic processes. However, 
it now appears that these mechanisms only explain tamoxifen 
5 resistance in a small percentage of cases. Other mechanisms 
are now being considered. We favor a model in which epigenetic 
changes occur within target cells affecting their ability to 
recognize tamoxifen as an antagonist and may in fact permit 
them to recognize the drug as an estrogenic ligand. This 
. 10 hypothesis stems from the observation that tamoxifen is in fact 
a selective estrogen receptor modulator (SERM) which can 
function as an ER antagonist, or an agonist, depending on the 
cell background in which it is studied. Thus, we believe that 
in breast the selective pressure of tamoxifen promotes the 

15 outgrowth of a population of cells, through accommodation or 
selection, which recognize tamoxifen as an agonist. 
Consequently, we and others, have focused on defining the 
molecular basis for the cell selective actions of tamoxifen, 
and other SERMs, with a view to understanding tamoxifen 

20 resistance and the eventual development of novel antiestrogens . 
These studies have revealed that upon binding ligand, ER 
undergoes a conformational change, the nature of which is 
influenced by the structure of the bound ligand. The 
significance of these conformational changes was revealed when 

25 it was determined that ER contains two activation domains, AF-1 
located at the amino terminus and AF-2 contained within the 
hormone binding domain, the activity of which is influenced by 
both cell and promoter context. In most cells both AFs are 
required for maximal transcriptional activity. Accordingly, 

3 0 it has been shown that estradiol functions as an ER agonist in 
all cells as it facilitates the interaction of both AFs with 
the transcription apparatus. It has now been determined that 
tamoxifen alters ER structure in a manner which inhibits AF-2 
function. Thus, in all contexts where AF-2 is required, 

35 tamoxifen manifests antagonist activity. In cell contexts 
where AF-1 alone is sufficient for ER transcriptional activity 
we have determined that tamoxifen can function as a partial 
agonist. This finding led us to hypothesize that the residual 
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agonist activity of tamoxifen, observed in AF-1 dominant 
environments, may be linked to the failure of this drug as an 
antiestrogen in breast cancer. Thus, we searched for compounds 
which did not activate AF-1 and evaluated their ability to 
5 inhibit tamoxifen partial agonist activity. This work led to 
the discovery of a novel antiestrogen, GW5638, which when 
assayed in vitro, inhibits tamoxifen partial agonist activity 
under all conditions examined and effectively inhibited the 
growth of MCF-7 cell xenografts in A-thymic nude mice. Because 
10 of these properties, GW5638 will soon enter clinical trails for 
evaluation as a treatment of tamoxifen refractory breast 
cancer . 

One of the surprising properties of the novel 
antiestrogen, GW5638, is that although it is devoid of AF-1 and 

15 AF-2 agonist activity it is not a pure antagonist when assayed 
in vivo. Unlike tamoxifen, it does not display uterotrophic 
activity. However, like tamoxifen, it functions as an estrogen 
in bone and the cardiovascular system. These results indicate 
that the ability to differentially activate AF-1 and AF--2 may 

2 0 be important but that the pharmacology of this class of 
antiestrogens is more complex than we anticipated. 
Consequently, we have focused recently on defining the 
molecular mechanism (s) by which cells distinguish between 
tamoxifen and GW5638. Although still ongoing, it has led co 

2 5 the development of a novel approach to inhibit the partial 

agonist activity of tamoxifen. Specifically, using phage 
display technology we have identified small peptides whose 
interaction with ER is influenced by the nature of the bound 
ligand. Peptides have been found which interact with ER in the 

3 0 presence of any ligand, in the presence of any agonist, in the 

presence of any antagonist and more importantly, we have 
identified peptides which interact with ER only in the presence 
of tamoxifen. With respect to the development of strategies 
to treat tamoxifen refractory breast cancer, the latter 
3 5 peptides are the most interesting as we have shown in vitro 
that these peptides efficiently inhibit tamoxifen partial 
agonist activity. Mapping of the sites on ER with which these 
peptides interact will help in determining if they mimic 
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specific coactivator interactions. Regardless however, this 
work has defined several sites on ER that will serve as targets 
for new drug discovery. Although peptides do not generally 
serve as good starting places for drug development, there has 
5 been a tremendous amount of progress of late in generating 
small molecules which modulate protein-protein interactions. 
Consequently, we are now in the process of screening for small 
molecules which interact with the target sites implicated by 
the novel peptides and additionally are in the process of 
10 defining smaller peptides which in themselves may be useful, 
if suitably formulated, as drugs. 

tj In Vivo Diagnostic Uses 

'^f- Analyte-binding molecules can be used for in vivo imaging. 

fu Radio- labelled binding molecule may be administered to the 

15 human or animal subject. Administration is typically by 
injection, e.g., intravenous or arterial or other means of 
'^^ administration in a quantity sufficient to permit subsequent 

1^ dynamic and/or static imaging using suitable radio-detecting 

13 devices . The preferred dosage is the smallest amount capable 

I? 20 of providing a diagnostically effective image, and may be 

determined by means conventional in the art, using known radio- 
imaging agents as a guide. 

Typically, the imaging is carried out on the whole body 
of the subject, or on that portion of the body or organ 
25 relevant to the condition or disease under study. The radio- 
labelled binding molecule has accumulated. The amount of 
radio- labelled binding molecule accumulated at a given point 
in time in relevant target organs can then be quantified. 

A particularly suitable radio-detecting device is a 
30 scintillation camera, such as a gamma camera. A scintillation 
camera is a stationary device that can be used to image 
distribution of radio- labelled binding molecule. The detection 
device in the camera senses the radioactive decay, the 
distribution of which can be recorded. Data produced by the 
35 imaging system can be digitized. The digitized information can 
be analyzed over time discontinuously or continuously. The 
digitized data can be processed to produce images, called 
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frames, of the pattern of uptake of the radio- labelled binding 
protein in the target organ at a discrete point in time. In 
most continuous (dynamic) studies , quantitative data is 
obtained by observing changes in distributions of radioactive 
5 decay in target organs over time. In other words, a time- 
activity analysis of the data will illustrate uptake through 
clearance of the radio- labelled binding molecule by the target 
organs with time. 

Various factors should be taken into consideration in 

10 selecting an appropriate radioisotope. The radioisotope must 
be selected with a view to obtaining good quality resolution 
upon imaging, should be safe for diagnostic use in humans and 
animals, and should preferably have a short physical half -life 
so as to decrease the amount of radiation received by the body. 

15 The radioisotope used should preferably be pharmacologically 
inert, and, in the quantities administered, should not have any 
substantial physiological effect. 

The binding molecule may be radio- labelled with different 
isotopes of iodine, for example ^^^I, ^^^I, or ^^^I (see for 

20 example, U.S. Patent 4,609,725) . The extent of radio-labeling 
must, however be monitored, since it will affect the 
calculations made based on the imaging results (i.e. a 
diiodinated binding molecule will result in twice the radiation 
count of a similar monoiodinated binding molecule over the same 

2 5 time frame) . 

In applications to human subjects, it may be desirable to 
use radioisotopes other than ^^^I for labelling in order to 
decrease the total dosimetry exposure of the human body and to 
optimize the detectability of the labelled molecule (though 

30 this radioisotope can be used if circumstances require) . Ready 
availability for clinical use is also a factor . Accordingly, 
for human applications, preferred radio- labels are for example, 
99-Tc, ^^Ga, ^^Ga, ^°Y, ^^^In, ^^^"^In, ^^^I, ^^^Re, ^«^Re or ^^^At . 

The radio- labelled binding molecule may be prepared by 

35 various methods. These include radio-halogenation by the 
chloramine - T method or the lactoperoxidase method and 
subsequent purification by HPLC (high pressure liquid 
chromatography), for example as described by J. Gutkowska et 
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al in "Endocrinology and Metabolism Clinics of America: (1987) 
16 (1) :183. Other known method of radio-labelling can be used, 
such as lODOBEADS™. 

There are a number of different methods of delivering the 
5 radio-labelled binding molecule to the end-user. It may be 
administered by any means that enables the active agent to 
reach the agent's site of action in the body of a mammal. If 
the molecule is digestible when administered orally, parenteral 
administration, e.g., intravenous, subcutaneous, or 
10 intramuscular, would ordinarily be used to optimize absorption. 

Other Uses 

The binding molecules of the present invention may also 
be used to purify target from a fluid, e.g., blood. For this 
purpose, the target -binding molecule is preferably immobilized 
15 on a solid-phase support. Such supports include those already 
mentioned as useful in preparing solid phase diagnostic 
reagents . 

Peptides, in general, can be used as molecular weight 
markers for reference in the separation or purification of 

20 peptides by electrophoresis or chromatography. In many 
instances, peptides may need to be denatured to serve as 
molecular weight markers. A second general utility for 
peptides is the use of hydrolyzed peptides as a nutrient 
source. Hydrolyzed peptide are commonly used as a growth media 

25 component for culturing microorganisms, as well as a food 
ingredient for human consumption. Enzymatic or acid hydrolysis 
is normally carried out either to completion, resulting in free 
amino acids, or partially, to generate both peptides and amino 
acids. However, unlike acid hydrolysis, enzymatic hydrolysis 

30 (proteolysis) does not remove non-amino acid functional groups 
that may be present. Peptides may also be used to increase the 
viscosity of a solution. 

The peptides of the present invention may be used for any 
of the foregoing purposes , as well as for therapeutic and 

35 diagnostic purposes as discussed further earlier in this 
specification . 
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EXAMPLES 
Exampl e 1 

Initial Studies Relating to the Estrogen Receptor 

The estrogen receptor (ER) is a member of the steroid 
5 family of nuclear receptors. Like other nuclear receptors, the 
ER is a ligand dependent transcriptional activator. R. C. J. 
Ribeiro, P. J. Kushner, J. D. Baxter, Ann. Rev. Med. 46, 443, 
(1995); J.-M. Wurtz et al . , Nat. Struct. Biol. 3, 87 (1996); 
D. Moras and H. Gronemeyer, Curr. Opin. Cell Biol. 10, 3 84 

10 (1998) . Two distinct estrogen receptors have been described, 
ER q: and ER jS, which may play distinct roles in gene 
regulation. K. Paech et al . , Science 277, 1508 (1997); G. G. 
J. M. Kuiper and J. -A Gustafsson, FEES Lett. 410, 87 (1997); 
J. T. Moore et al . , Biochem. Biophys . Res. Cowm. 247, 75 

15 (1998); V. Giguere, A. Tremblay, G. B. Tremblay, Steroids 63, 
335 (1998). In addition to the natural ligand, estradiol, the 
activity of the estrogen receptor is regulated by the 
association/dissociation of accessory proteins collectively 
termed co-activators and co-repressors . J. Torchia et al ; , 

20 Nature 387, 677 (1997); C. K. Glass, D. W. Rose, M. G. 
Rosenfeld, Curr. Opin. Cell Biol. 9, 222 (1997); J. Torchia, 
C. Glass, M. G. Rosenfeld, ibid. 10, 373 (1998) . Upon binding 
estradiol, the ER undergoes a conformational change that 
exposes sites for the association of co-activating proteins. 

25 This change may also conceal the binding sites for co- 
repressors or other molecules that are associated with the 
inactive receptor, thus preventing their association. 

The estrogen receptor is a therapeutic target for diseases 
such as breast and ovarian cancer, and it is also the target 

3 0 for drugs that ameliorate symptoms and effects of menopause 
including osteoporosis. While effective, compounds that target 
the estrogen receptor can exhibit a variety of effects in 
different target tissues. For example, tamoxifen is an estrogen 
receptor antagonist in breast tissue and is effective in 

35 slowing the growth of ER positive breast tumors. However, 
tamoxifen can have agonist effects on uterine cell growth. M. 
A. Gallo and D. Kaufman, Seminars in Oncology 24 (suppl.l) , Sl- 
71 (1997). Because of their wide range of effects, estrogen 
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receptor targeted drugs cannot be classified as strict agonists 
or antagonists, but are more appropriately called s.elective 
estrogen receptor modulators or SERMS . H. U. Bryant and W. H. 
Dere, Proc . Soc . Exp. Biol. Med. 217, 45 (1998). SERMs appear 
5 to drive the receptor into conformations that are neither fully 
active nor inactive. Distinguishing between these various 
intermediate conformations in an in vitro environment has been 
a difficult task at best. We have developed peptidic probes 
that allow distinction between ER conformations induced by 

10 different SERMs. Each SERM, which has a distinct biological 
effect, also produces a unique pattern in the fingerprint 
assay. These probes should provide valuable tools for both 
research and drug discovery, and may provide a link between 
receptor conformation and biological activity. 

15 In this example, peptides were identified which bind to 

the unliganded estrogen receptor o: (ex. 1.1; table 1) or to the 
estradiol-activated receptor (ex 1.2, table 2) . These Era- 
binding peptides were then classified (ex 1.3) into five 
arbitrary classes on the basis of their ability to bind to Era; 

20 or Er^ in the presence or absence of estradiol. (Naturally, 
they all bound either the apo-ERa or the estradiol-activated 
Erof . ) Finally, representative peptides of each class were used 
to "fingerprint" the known ER SERMs estradiol, estriol, 
nafoxidine, tamoxifen or clomifene (ex. 1.4) . 

25 Example 1.1: Identification of peptides that bind to the 

unliganded (unactivated) estrogen receptor ot 

ER alpha (Panvera Corp.) was immobilized on Immulon 4 

plastic plates (Dynatech) for the phage affinity selection as 

described in patent application Fowlkes, 09/050,359. Peptide 
3 0 sequences obtained for binding to the unliganded (unactivated) 

receptor are listed below (Table 1) . 

Example 1.2: Identification of peptides that bind to the 
estradiol activated estrogen receptor a. 

Estrogen receptor was immobilized as described above and 
35 incubated with 100 /xM estradiol for 15 minutes prior to the 
addition of phage for affinity selection. Sequences obtained 
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in the presence of estradiol are listed in Table 2. 

In the presence of estradiol, numerous sequences were 
isolated which contain the consensus LXXLL . This motif, which 
is found in nuclear receptor co-activators, has previously been 
5 shown to be necessary and sufficient for their association with 
nuclear receptors. This association is accomplished via a 
helical region found in the ligand binding domain of the ER 
that is exposed upon binding of estradiol. Crystallographic 
studies indicated that this region is not properly positioned 

10 in the presence of some SERMS, thus preventing co-activator 
association at this site. See generally D. M. Heery, E. 
Kalkhoven, S. Hoare, M . G. Parker, Nature 387, 73 (1997); M. 
Nichols, J. M. J. Rientjes, A. F. Stewart, EMBO J. 17, 765 
(1998); W. Feng et al . , Science 280, 1747 (1998); A. M. 

15 Brzozowski et al . , Nature 389, 753 (1997). 

Consistent with this, peptide sequences containing the 
LXXLL motif were not isolated during affinity selection on the 
apo-receptor or in the presence of 4 -OH tamoxifen. 

Example 1.3: Classification of peptide sequences 
20 a.) Comparison of phage vis-a-vis binding to the ER a and 

(3 in the presence or absence of estradiol 

Phage expressing distinct peptide sequences were 

classified according to a number of different parameters. 

Initial studies measured the relative binding of each of the 
25 phage to ER a and (3 in the absence or presence of estradiol. 

ER Of and (3 were immobilized on Immulon 4 plates and treated for 

15 minutes with 100 fxM estradiol or buffer alone prior to the 

addition of phage supernatant from a fresh overnight culture. 

Bound phage were detected using an anti-M13 antibody coupled 
3 0 to HRP. From these results, 12 phage were selected for further 

study. Sequences were selected that bound preferentially to 

ER a or ER jS and that bound preferentially in the absence or 

presence of estradiol . 

b) Competition of phage with a peptide containing an LXXLL 
3 5 motif. 

The co-activating proteins that that have been identified 
to date, interact with nuclear receptors via a leucine rich 
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region on the coactivator with the consensus LXXLL, where L is 
leucine and X is any amino acid. Co-activators containing this 
consensus motif bind to the ER at helix 12 in the AF2 domain 
(the C-terminal transactivation domain) . This helical region 
5 is exposed when the receptor is activated. Many of the peiptide 
sequences that were isolated for the activated receptor were 
leucine rich and a great number contained the LXXLL motif. All 
of these sequences bound preferentially to the activated ER. 
A peptide containing an LXXLL motif was synthesized and used 
10 in competition assays with phage to determine if the binding 
of the LXXLL peptide to the ER would affect the binding of the 
phage. The peptide sequence corresponds to peptide #4 that was 
isolated in the presence of estradiol: SSNHQSSRLIELLSRSGSGK- 
biotin . 

15 ER ot and (3 were immobilized as described above and pre- 

incubated in the presence of 100 /xM LXXLL peptide, 100 /xM 
estradiol, buffer alone, or a combination of 100 /xM estradiol 
and 100 /xM peptide for 2 0 min prior to adding phage supernatant 
from a fresh overnight culture. Bound phage were detected as 

20 described above. All of the phage expressing an LXXLL 
containing peptide were competed by the peptide, and several 
other phage that do not contain the LXXLL motif were also 
competed by the peptide. These phage may express sequences that 
mimic the LXXLL motif, or they may be allosterically affected 

25 by the binding of the peptide. There were also phage that do 
not contain an LXXLL motif that did not compete with peptide. 

Based on these data, the peptide sequences were divided 
into 5 classes listed below, as seen in Table 3 and 4 . Table 
3 lists peptides of each class, while Table 4 defines the 

30 classes. In a comparison of binding to unliganded ER a and jS, 
class 1 and class 5 peptides have higher affinity for ER . 
Class 2, 3 and 4 peptides have higher affinity for ER Of. 
Ligand (estradiol) increases the affinity of class 1 peptides 
for both ER a and /3, and decreases the binding of class 5 

35 peptides to both receptors. Ligand has no effect on the 
binding of class 2 peptides to either receptor. Ligand 
increases the binding of class 3 peptides to ER of, while having 
no effect on ER (3, and ligand decreases the binding of class 
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4 peptides to ER a while also having no effect on ER /S. A 
peptide containing an LXXLL motif, described above, was able 
to compete with phage from class 1 on both ER a and /8, and with 
phage from classes 4 and 5 on ER of only. Phage from classes 
5 2 and 3 did not compete with the LXXLL peptide on either 
receptor . 

Example 1,4: Fingerprinting- estrogen receptor agonists 
and SERMs 

There are many known agonists and SERMs for the estrogen 
10 receptor. For initial testing of the fingerprinting system, 
two agonists, 11 - 13 estradiol and estriol, and three SERMs, 4 -OH 
tamoxifen, nafoxidine and clomiphene, were selected. All three 
SERMs are derivatives of triphenylethylene . All reagents were 
purchased from Sigma . 
15 The effect of agonists and SERMs on the binding of phage 

from each of the 5 classes described above was investigated. 
To do this, immobilized ER a or ER jS was incubated with 100 ^M 
estradiol, estriol, nafoxidine, tamoxifen or clomifene in TEST 
or with TEST alone for 2 0 minutes prior to adding the phage 
20 supernatant from a fresh overnight culture. Following a 1 hour 
incubation, the wells were washed five times with TEST and the 
bound phage were visualized using an anti-M13 antibody coupled 
to HRP. 

The following fingerprints were identified (Table 6) . The 
25 data are based on the. relative change in binding (as determined 
by an increase or decrease in absorbance) compared to the 
unliganded receptor. The number of + or - signs indicates the 
degree and the direction of the change in signal; +/- indicates 
no significant change. 

30 The agonists (estradiol and estriol) produce fingerprints 

that are distinct from those of the SERMs (tamoxifen, 
nafoxidine and clomiphene) . In addition, the fingerprints are 
different for ER a and ER (3 . As predicted, the agonists, which 
have similar biological effects, produce fingerprints that are 

3 5 similar on each receptor. The SERMs are all from the same 
class of triphenylethylene derivatives and have similar yet 
distinct biological effects. The fingerprinting analysis 
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readily distinguishes them from pure agonists and also 
indicates that they may have similar yet distinct in vivo 
activities . 

If an increase in the binding of a class 1 peptide 
indicates agonist activity, then the fingerprint suggests that 
tamoxifen produces low levels of agonist activity on ER of and 
no agonist activity on ER (3. Similarly, if the reduction in 
the binding of a class 4 peptide indicates agonist activity, 
then the fingerprint suggests that tamoxifen has antagonist 
activity on both ER a and (3. The combination of the signals 
with each peptide class creates a fingerprint for the SERM that 
provides information on the relative levels of agonist and 
antagonist activity it produces. The differential changes in 
the signals on ER of and ER 13 may indicate the tissue 
specificity of the alteration in receptor activity in response 
to the SERM. 
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Example 2 Further Investiga.tions with Estrogen Receptors 

Affinity selection of phage displayed peptide libraries 
(Sparks, et al . (1996) , Phage Display of Peptides and Proteins, 
A Laboratory Manual, pp. 22 7-2 53) was conducted on both ERof and 
5 jS under conditions that were predicted to place the ER in 
different conformations: apo-ER, estradiol bound ER and 4-OH 
tamoxifen bound ER. Unique sets of high affinity peptides were 
identified under each condition. Most notably, affinity 
selection of peptides in the presence of estradiol revealed a 

10 number of sequences containing an LXXLL motif (Table lOOA) . 
This motif, which is found in nuclear receptor co-activators 
(Table lOOB) , has been shown to be necessary and sufficient for 
their association with nuclear receptors (Heery, et al . (1997) , 
Nature, 3 87:733-736) . Studies have .shown that the association 

15 of the LXXLL motif with the ER is accomplished via a helical 
region in the ligand binding domain of the receptor that is 
exposed upon binding estradiol. Structural studies using X-ray 
crystallography have shown that this region is not properly 
positioned in the presence of raloxifene (Brzozowski, et al . 

20 (1997)) or 4-OH tamoxifen (Shiau, et al . (1998)), thus 
preventing the interaction of the co-activator LXXLL motif. 
The identification of these sequences in the presence of 
estradiol indicate that the ER is undergoing conformational 
changes in response to ligand in vitro consistent with the 

25 changes that are predicted to occur in vivo. 

Materials 

Estrogen receptor or and 13 were purchased from PanVera 
Corporation, Madison, WI . Immulon 4 96-well plates were from 
Dynatech. Streptavidin, 17-^6 estradiol, 4-OH tamoxifen, 

30 nafoxidine, clomiphene, diethylstilbestrol , progesterone, 16-of 
OH estrone, and estriol were purchased from Sigma. Premarin 
is a product of Wyeth-Ayerst . Raloxifene is a product of Eli 
Lilly Corporation. ICI 182,780 was purchased from Tocris 
Cookson Inc., Ballwin, MO. Anti-M13 antisera was purchased 

35 from Pharmacia. Sequencing of single strand M13 DNA was 
conducted by Sequetech Corp., Mountain View, CA. Peptide 
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synthesis was conducted by AnaSpec, San Jose, CA. 
Exa.mpl e 2.1: 

Additional phage affinity selections were made of peptides 
which bound plastic-immobilized Erof in the presence of the 
5 SERMs 4 -OH Tamoxifen, ICI 182,780, or both simultaneously (see 
Table 7) . 

Exampl e 2,2: 

Further phage affinity selections were made with ERof or 
ErjS conjugated to ERE (estrogen response element) , which in 
10 turn was immobilized. For Erof, selections were carried out with 
no ligand present (apo-receptor ) , or in- the presence of 17-/3 
estradiol, 4-OH Tamoxifen, Raloxifen, or ICI 182,780. 

The methodology is described in more detail below. 
Affinity selection of phage for the various conformations of 

15 the estrogen receptor was conducted essentially as described 
(Sparks, et al . (1996)). Selections were conducted with the 
estrogen receptor in TEST (lOnM Tris-HCl, pH 8.0, 150 nM NaCl, 
0.05% Tween 20), or in TEST containing 1 fxM 17-/3 estradiol, or 
4-OH tamoxifen. Immulon 4 96 -well plates were coated with 

20 streptavidin in 0.1 sodium bicarbonate. The plates were then 
incubated for 1 h with 2 pmol biotinylated, vitellogenin 
estrogen response element (ERE) per well (Anderson (1998) , 
Biochemistry, 37:17287-17298), followed by incubation for 1 h 
with 3 pmol (monomer) ERa or ERjS per well . Oligonucleotides 

25 corresponding to the vitellogenin ERE, biotin- 
GATCTAGGTCACAGTGACCTGCG (forward) and biotin- 

GATCCGCAGGTCACTGTGACCTA (reverse) , were synthesized by Genosys . 
The sequenced active peptides are shown in Table 8. For ERjS, 
selections were carried out with no ligand present, or in the 

30 presence of estradiol or tamoxifen. The resulting active 
peptides are shown in Table 9. 

Exampl e 2.3 

All of the phage were classified based on their ability 
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to bind to ER a and ER ^8, in the presence or absence of SERMs . 
These assays were conducted by phage ELISA. In essence, plastic 
plates were coated with streptavidin (sigma) . Biot inylated- 
EREs (see above) were conjugated to the solid-phase 
5 streptavidin, and ER to the ERE. Bound phage were detected 
using horseradish peroxidase-labeled anti- (M13 phage) 
antibodies . 

The ER was then incubated with 100/xl TBST or TEST 
containing 1 of the appropriate modulator. Phage {40//1), 
10 from a 5 hour culture grown in DH5q:F' cells, was added directly 
to the wells and incubated 30 minutes at room temperature. 
Unbound phage were then removed by 5 washes with TBST. Bound 
f3 phage were detected using an anti-M13 antibody coupled to 

'^f horseradish peroxidase (HRP) ...Assays were developed with 2,2' - 

fy 15 azinobis (3 -ethylbenzothiazoline) -6 sulfonic acid (ABTS) and 

hydrogen peroxide for 10 minutes and then stopped by the 
addition of 1% SDS . Absorbance was measured at 405 nm in a 
Molecular Devices microplate reader. 

H The results are shown in Tables 11-13, as follows: 

£3 

fy 20 Table 11, Binding of ERoj-Selected Peptides to ; ERcy 

Receptor; 

\1 Table 12, Binding of ERof-Selected Peptides to ER/3 

Receptor; 

Table 13, Binding of ER^-Selected Peptides to ERa or ER/3 
25 Receptors. 

The binding activity is indicated on a semiquantitative 
scale of 0 to 7+. 

Exampl e 2,4 

Selection and Characterization of Panel Peptides 

3 0 All of the affinity selected phage were evaluated by phage 

ELISA for binding to apo- ERo; and ^S, and to ERof and /3 in the 
presence of estradiol or 4 -OH tamoxifen as described above. 
Many phage showed distinct preferential binding. Some 
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sequences bound more strongly to the apo-receptor , while others 
exhibited preferential binding to the estradiol activated or 
the 4 -OH tamoxifen activated receptor. Based on this analysis, 
eleven phage expressing different peptide sequences and showing 
5 distinct binding preferences, were chosen for further use as 
conformational probes . 

Five of these probes bound to both ER a. and ER (3((y/P I-V) , 
three were specific for ER a {a I-III) , and three were specific 
for ER 13 {(3 I-III) (see Table 10) . One may view this either as 

10 defining a three class panel, with several representatives in 
each class, or as an eleven class panel, with one member per 
class. The identification of distinct classes of peptides, 
some of which recognized both ERof and ERjS, and others that were 
receptor specific is consistent with the primary structures of 

15 the two receptors being similar yet distinct. 

The binding sites of the probes, oi/^ I-V and o; I-III, were 
mapped on ER o: using ERo! ligand binding domain (residues 282- 
595) fused to glutathione-S- transferase (GST) , an Era amino 
terminal domain (1-184) fused to GST, and the full length ER. 

2 0 Assays were conducted using the format described in Example 

2.3, except that the domains were directly immobilized on the 
plastic surface of the well. Assays were conducted as for 
phage ELISA (Ex. 2.3). Results are shown in Fig. 2. 

All of the probes except ot I bound to the ligand binding 
25 domain. The ot I probe, which binds only to the full length 
protein, may be binding to a site that is created by the 
tertiary structure formed by the interaction between receptor 
domains . 

The probes were used to fingerprint the interaction of ER 

3 0 Of and ER jS with a variety of different SERMs by the assay 

method previously described (using ERE) . Next, we evaluated 
the binding of each of the probes to ERcy and ERjS in the 
presence of a variety of ER ligands that have distinct 
biological activities. The goal was to determine if each of 
3 5 the ligands would induce a conformational change in the ER that 
would alter the binding pattern of the probes, thus producing 
a "fingerprint" for each compound. The ligands used for this 
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study include the ER agonists estradiol, estriol, and 
diethylstilbestrol (DES) ; the SERMs 4 -OH tamoxifen, nafoxidine, 
clomiphene, and raloxifene; the antagonist ICI 182,780; and the 
estradiol metabolite 16-a-OK estrone. Premarin, the mixture 
5 of conjugated estrogens used as estrogen replacement therapy, 
was also included, but it should be noted that many of the 
components of Premarin must be metabolically activated. Thus, 
their action may not be detected in this In vitro assay. 
Buffer only (apo-receptor) and progesterone were included as 
10 controls. Information on the structures and biological effects 
of the SERMs used in this study may be found in the following 
papers and reviews: B. S. Katzenellenbogen, M. M. Montano, K. 
r3 Ekena, M. E. Herman, E. M. Mclnerney, Breast Can, Res. Treat. 

^3 44, 23 (1997) ;J. I. Macgregor and V . C. Jordan, Pharma.cologica.1 

fy 15 Rev. 50, 151 (1998); B. T. Zhu and A. H. Conney, 

^-3 Carcinogenesis 19, 1 (1998); M. T. R. Subbiah, Proc . Soc . Exp. 

M Biol. Med. 217, 23 (1998); Sulistiyani, S. J. Adelman, A. 

M Chandrasekaran, J. Jayo, R.W. St. Clair, Arteriosclerosis, 

f Thrombosis, and Vascular Biology IS , 837 (1995); B. R. Bhavnani 

fi 20 and A. Cecutti, J. Clin. Endocrinol. And Metab. 78, 197 

(1994); B. Bhavnani, Proc. Soc. E:Kp. Biol. Med. 217, 6 (1998); 
3 T.A. Grese et al . , Proc. Natl. Acad. Sci . USA 94, 14105 (1997); 

J T. A. Grese et al . , J". Med. Chem. 41, 1272 (1998); A Howell, 

Oncology 11, suppl 1, 59 (1997) . 

25 As shown in Table 14, each of the ligands tested did 

indeed alter the binding pattern of the probes, producing a 
distinct fingerprint for each, whereas the pattern produced by 
progesterone was indistinguishable from that produced by 
buffer . 

3 0 The unique ligand dependent binding patterns of the probes 

indicates that each ligand induces a receptor conformational 
change that exposes different peptide binding surfaces. The 
binding patterns for estradiol and ICI 182,780 are distinct or 
both ERck' and jS, confirming the conformational change 

35 illustrated by the earlier protease digestion studies. The 
protease digestion assay, which relies on the location of 
cleavage sites for detection of conformational changes, could* 
distinguish between conformational changes induced by estradiol 
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and 4 -OH tamoxifen or estradiol and ICI 182,78 0. However, it 
was unable to distinguish between changes induced by 4 -OH 
tamoxifen and other ER modulators such as ICI 182,780. The 
fingerprint assay, however, clearly indicates that unique 
5 peptide binding surfaces are exposed on both ERof and jS in the 
presence of 4 -OH tamoxifen that are not exposed in the presence 
of ICI 182,780. Tamoxifen, nafoxidine and clomiphene contain 
the same triphenylethylene core structure. These three 
compounds, although similar in structure, produce distinct 

10 biological effects. Therefore, it might be predicted that 
these compounds would induce similar, yet distinct , 
conformational changes in the receptors. The fingerprint assay 
shows that the probes a/jS III, IV and V, which have high 
affinity for the ER in the presence of 4 -OH tamoxifen, have 

15 lower affinity for the ER complexed with nafoxidine and 
clomiphene, indicating that these peptide binding surfaces 
differ in the presence of these compounds. The a III probe 
more clearly differentiates these three compounds. The 
fingerprint assay also differentiates 4 -OH tamoxifen and 

20 raloxifene. The probes ot/ (B III, IV and V have reduced affinity 
for both ER o; and (3 in he presence of raloxifene compared to 
4 -OH tamoxifen . The probes of/jS II , jS I and ^ III further 
distinguish ER (B conformational changes induced by these two 
compounds. The fingerprint pattern produced by Premarin is 

25 distinct compared to other agonists; however, Premarin' s 
activities are due to a mixture of components. It would be 
interesting to assess the binding patterns of the probes in the 
presence of each of the purified, activated components of 
Premarin. 

30 The probe ot/^ I contains an LXXLL motif. The binding of 

estradiol to the ER strongly enhanced the binding of this probe 
to both ER ot and ER However, estriol, Premarin and DES, 

which are also considered ER agonists failed to activate the 
binding of this probe to ER of to the same extent as estradiol. 

35 On ER j8, the binding of the probe was enhanced significantly 
with all of the agonists. The SERMs, 4 -OH tamoxifen, 
nafoxidine, clomiphene, raloxifene and ICI 182,780 prevented 
the binding of this probe to both ER o; and j8 and appeared to 



141 

reduce the binding to a level below that which is observed in 
buffer alone. 

The probes a./ ^ III-V show enhanced binding in the presence 
of SERMs, particularly 4-OH tamoxifen, indicating that a new 
5 binding surface is exposed on the ER in the presence of these 
compounds. The binding patterns of these three probes along 
with the probes Of/jS II, a. Ill, jS I and ^ III illustrate 
differences in the receptor conformation induced by 4-OH 
tamoxifen, nafoxidine, clomiphene, and raloxifene. Since the 

10 binding of the probes to the ER in the presence of these SERMs 
may be altered but not abrogated, subtle changes in receptor 
conformation can be visualized. This is the first in vitro 
assay that distinguishes between these four compounds. The 
probe q: II is also unique in that it binds to ER a. in the 

15 presence of any compound that binds to the estrogen receptor, 
indicating that while some receptor conformational changes are 
unique to the modulator, others may be more universal. 
Overall, these probes allow the detection of both subtle and 
distinct conformational changes that are induced by many 

20 different modulators of ER activity. 

To confirm that the binding of the probes to the ER was 
dependent upon the peptide expressed on the surface of the 
phage, biotinylated peptides corresponding to the sequences 
were synthesized with biotin attached to a carboxy- terminal 
25 lysine. The peptides were coupled to europium labeled 
streptavidin and binding studies were conducted using time 
resolved fluorescence spectroscopy (TRF) . 

Time resolved fluorescence (TRF) assays were 
performed at room temperature as follows: Costar high-binding 

30 384 well plates were coated with streptavidin in 0 . 1 M sodium 
bicarbonate and blocked with bovine serum albumin. 
Biotinylated ERE (2 pmol) was added to each well. Following 
a 1 h incubation, biotin was added to check any remaining 
binding sites . The plates were washed and 2 pmol ERo; was added 

35 to each well. Following a Ih incubation, the plates were 
washed and the ER modulators were added at a range of 
concentrations, from picomolar to micromolar. Following a 3 0 
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min incubation with the modulators, 2 pmol of a europium 
labeled streptavidin (Wallace) -biotinylated peptide conjugate 
(prepared as described below) was added and incubated for 1 h. 
The plates were then washed and the europium enhancement 
5 solution was added. Fluorescent readings were obtained with 
a POLARstar fluorimeter (BMG Lab Technologies) using a <400 nm 
excitation filter and a 620 nm emission filter. The europium 
labeled streptavidin-biotinylated peptide conjugate was 
prepared by adding 8 pmol biotinylated peptide to 2 pmol 
10 labeled streptavidin. After incubation on ice for 30 min, the 
remaining biotin binding sites were blocked with biotin prior 
to addition to the ER coated plate. 

The binding of the probes to the ER was measured. The 
results, shown in Table 15, indicate that the peptides are 

15 indeed conferring the binding specificity. Comparison of the 
fluorescence values obtained from the TRF binding assays and 
the signals obtained in the phage ELISA fingerprint indicates 
that the two methods produce sim.ilar patterns. However, the 
binding assay also provides an. indication of the potency of 

20 each compound to induce the conformational change required for 
peptide binding . Taken together, these results indicate that 
conversion of the fingerprint assay from phage to peptides will 
provide an even more sensitive assay for detecting 
conformational change . 

25 One of the most notable observations from the TRF binding 

assays is that the binding of the jS I probe to ER ^ is enhanced 
in the presence of the SERM 4 -OH tamoxifen and reduced in the 
presence of other SERMs such as raloxifene, nafoxidine, and 
clomiphene. The reduction in binding observed with these 

3 0 compounds is similar to the reduction observed with agonists 
such as estradiol, estriol, and DES . 

We have identified peptides that serve as conformational 
probes of the estrogen receptor o; and (3. Many probes bind to 
both receptors, while other probes bind preferentially to 
35 either the o; or (3 receptor. Consistent with the two receptors 
having regions of high homology and other more divergent 
regions, these results indicate that the receptors have some 




binding surfaces in common, while others are unique. The 
implications of this are that both receptors may contact some 
of the same regulatory proteins in the cell, yet there may be 
additional proteins that specifically regulate either ER a or 
5 (3 action. 

We have used our peptidic probes to show that both 
receptors undergo distinct conformational changes as a result 
of binding different ligands. The probes not only reveal 
receptor conformational changes by their relative changes in 

10 affinity, but they also identify unique binding surfaces on the 
two receptors. These binding surfaces may, in fact, be the 
surfaces that interact with various co-regulatory proteins in 
response to different ligands. For example, many peptides 
selected with the estradiol ... activated receptor contained 

15 sequences found in nuclear receptor co-activators, as 
illustrated by the peptides containing the LXXLL motif (Figure 
1) . These peptide probes are probably mimicking, the 

interaction between the receptor and co-act ivating proteins . 
Potentially, these probes can be used to identify heretofore, 

2 0 unknown receptor -protein interactions . 

Additional applications of the probes lie in the area of 
detection of ER modulators. One or more probes can be used to 
set up a high- throughput screen to identify modulators of ER 
activity. We anticipate that compounds that bind to the ER 
25 will alter receptor conformation and hence alter the binding 
patterns of the probes. The sites targeted by the screen may 
not be bona, fide protein-protein interaction surfaces, but may 
represent sites exposed in the presence of a specific ligand, 
and thus serve as markers for specific conformations. The 

3 0 fingerprinting technique may also be applied to quickly 

classify hits from a screen into different categories such as 
agonist (resembling the estrogen pattern) , antagonist 
(resembling the ICI 182,780 pattern), mixed (resembling the 
tamoxifen pattern) or novel effectors, prior to assessing them 
35 in a cell-based assay. Fingerprinting may also be used to 
determine structure activity relationships and to rapidly 
assess compounds following chemical modification during lead 
optimization . 
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This is the first technique described that can distinguish 
between estrogen receptor conformations induced by ligands both 
between and within ligand classes. The data gathered with this 
assay provide strong evidence that the biological activity of 
5 the estrogen receptor can be linked to the conformation induced 
upon binding ligand. A strength of this fingerprinting 
technique is that it is broadly applicable to any protein or 
receptor that undergoes structural changes upon binding of a 
ligand or substrate. 

10 These studies confirm that the assays may readily be 

conducted with synthetic peptides in place of phage-bound and - 
expressed peptides. 



Example 2.5 Analysis of Known SEEMS using Panel 

For fingerprint analysis of estrogen receptor modulators 
15 on ER Of and ER 13, estrogen receptor (3 pmol) was immobilized 
on 2 pmol biotinylated ERE. Immobilized ER was incubated with 
estradiol (1 /iM) , estriol (1 //M) , premarin (10 /xM) , 4-OH 
tamoxifen (1 juM) , nafoxidine (10 ^M) , clomiphene (10 fiM) , 
raloxifene (1 /xM) , ICI 182,780 (1 /xM) , IGof-OH estrone (10 /xM) , 
2 0 DES (1 /xM) or progesterone (1 /xM) for 5 minutes prior to the 
addition of phage. Phage were amplified from plaques in DHBof 
F' for 5 hours. Bound phage were detected as described 
previously. Assays were developed with ABTS (2 , 2 ' -azinobis (3- 
ethylbenzthiazoline-sulf onic acid) for 10 minutes. 

25 The results are shown in Tables 14A and 14B. Table 14A 

shows binding to the ERof receptor and Table 14B binding to the 
ErjS receptor. It is not necessary to list all 11 panel peptides 
in each table since some only bind the ERa and others only the 
ERjS . The binding activity is indicated on a semiquantitative 

30 scale of 0 to 7+. 



Example 2.6 Calculation of Similarity Between SERMs 

Based on Tables 14A and B, one may define a fingerprint 
for each SERM. This fingerprint is an array of descriptors, 
each of which is a value in the Table representing the binding 
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affinity of a particular panel peptide for either ERof or ER^S 
in the presence of the SERM in question. The tables in 
question allow each fingerprint to be composed of 16 
descriptors (one for each row in the tables) . We obtain 12 
5 fingerprints, one for each of the 11 SERMs, plus buffer. 

We can therefore calculate the similarity of between each 
of the 12x12=144 possible pairings of these fingerprints. To 
begin with, we calculate the Euclidean distance between each 
fingerprint. This is the square root of the sum of the square 
10 of the differences between the respective column values in 
Tables 14A and 14B. For example, the distance between buffer 
and Estradiol is the square root of the sum of the square of 
the 16 descriptor pair differences, i.e., the square root of 
the sum of 
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30 for a total of 160, the square root of which is about 

12 . 65 . 
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The maximum possible distance in the present instance is 
the square root of 16* (7*7) , which is 28, This is because each 
descriptor pair has a maximum possible difference of 7, and 
there are 16 descriptors in the fingerprint. 

The distance may be converted into a similarity by 

current similarity= (maximum distance-current distance) 

/maximum distance , 

which equals 1 when the current distance is 0, and 0 when the 
current distance equals the maximum distance. In the example 
above (buff er : estradiol ) the similarity is 0.55. 

In contrast, the fingerprints of cloniphene and 
raloxiphene are at a Euclidean distance of sqrt(40), which is 
6.32, and therefore have a similarity of 0.77. 16q;-OH estrone 
and DES are even more similar, with a Euclidean distance of 
sqrt(7), which is 2.645, and therefore a similarity of 0.91. 

On the other hand, the fingerprints of estradiol and 
cloniphene are at a Euclidean distance of sqrt(210), which is 
14.49. This corresponds to a similarity of 0.48. 

It will be appreciated that we could have changed the 
choice and/or number of descriptors incorporated into the 
fingerprint, rescaled and/or weighted the descriptors in some 
way, used a different measure of distance, and/or converted 
distance into similarity by another method. We could also 
have determined similarity without first calculating a 
distance . 

In the above text, we have lumped together the data for 
ERo! and ERjS . We could have calculated separate fingerprints 
and similarities for each form of ER. This is shown in Figs. 
5 and 6 . 

Thus, for buff er : estradiol , the distance between their 
fingerprints for ERo? binding is SQRT(84) , and for ER^S binding, 
SQRT(76), or 9.16 and 8.72. The maximum distance is SQRT 
(8*7*7), which is 19.8. So the similarities are 0.54 and 0.56 
respectively, which aren't much different. 
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On the other hand, for ICI 182 780:16q?-OH estrone, we 
calculate distances of SQRT(6) for ERof binding, and SQRT(76) 
for ERjS binding, corresponding to similarities of 0.88 and 
0.56, respectively. So these compounds are more similar in how 
5 they bind ERof than in how they bind ERjS . 

This fingerprinting technique provides a rapid and 
sensitive method to detect changes in protein conformation. 
We have applied this technique to ER a. and jS and demonstrated 
that these two receptors undergo different conformational 

10 shifts in response to various modulators of activity. Because 
the pattern of probe binding is unique for each modulator, the 
assay can be used to distinguish compounds both between and 
within modulator classes. The assay can also be used to 
identify modulators that have specif icity for either. the a or 

15 jS form of the receptor. 

One or more probes can be used to set up a high- throughput 
screen (HTS) to identify modulators of ER activity. Compounds 
that bind to the ER and alter receptor conformation will alter 
the binding patterns of the probes. This technique may also 

20 be applied to classify hits from a HTS as agonist (resembling 
the estrogen pattern) antagonist (resembling the ICI 182,780 
pattern) or mixed (resembling the tamoxifen pattern) prior to 
assessing them in a cell-based assay. Fingerprinting may also 
be used for structure activity relationships. As chemical 

25 modifications are made to lead molecules, fingerprinting will 
provide a convenient method to quickly determine if the 
modification affects receptor conformation in a manner 
different than the parent compound. 

All of the compounds used in this study are known to 
3 0 produce unique biological effects in vivo. Many of the 
differential effects are tissue specific, perhaps due to 
differential expression of regulatory proteins and/or the two 
forms of the receptor. Each of these compounds also produces 
a unique fingerprint pattern in vitro, derived from the 
3 5 conformation adopted by the receptors upon binding the 
modulator. Thus, fingerprinting conformational changes induced 
by SERMs in vitro is expected to be useful for predicting the 
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in vivo biological activities of modulators. 

E:Ka.mple 3: Fingerprinting Using Yeast Two-Hybrid Cell-Based 
Assays 

5 The two hybrid methods of examining protein/protein 

interactions initially described by Fields and Song (Nature 
340:245-246 (1989)) and later by Gyrius , et al (Cell 75:791-803 
(1993)) utilize similar technologies. In both cases a yeast 
cell is provided as the host cell which carries a reporter gene 

10 operated by an upstream protein binding site (DNA binding 
site) . The host cells carry a plasmid expressing 

peptide/protein fusions with the specific binding protein or 
domain (DNA binding domain) . The host cell also carries a 
plasmid expressing a peptide/protein fusion with a 

15 transcriptional activation protein or domain (Activation 
domain) . If the two peptide/protein fusions are capable of 
directed interactions within the cell, transcriptional 
activation of the reporter gene occurs. The level of reporter 
gene transcription is reflective of the strength of the 

20 interaction between the two protein fusions. 

The LexA system that we employ utilizes a bacterial DNA 
binding protein domain, LexA, and a bacterially derived 
transcriptional activation sequence, B42 . Proteins or peptides 
of interest are fused in frame with these domains and expressed 

25 using episomal plasmids in a yeast cell. The interactions 
between these proteins/peptides of interest are registered by 
monitoring the level of the reporter gene product, (3- 
galactosidase , by an enzymatic assay. The differences in the 
levels of j8-galactosidase activities reflect the relative 

30 strengths of the protein interactions . 

We have tested the interactions of peptides F6 (an 
affinity-selected peptide with a high affinity for ERof) , alpha2 
(A2) , alpha/beta 3 (AB3) , and alpha/beta 5 (AB5) with estrogen 
receptor a using the LexA yeast two hybrid system in the 
35 presence of agonist or antagonist. These peptides were 
isolated previously from phage display libraries using estrogen 
receptor of (ERof) as a target . The interactions between these 
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peptides and ERo; are altered in the presence of agonist or 
antagonist in the in vitro phage display system. For example, 
peptide Qf2 was found to bind in the presence of estradiol and 
4 -OH-tamoxif en, but not in their absence; peptide a/^ 3 binds 
5 to ERof only in the presence of 4 -OH- tamoxifen, not estradiol 
or in the absence of any compound. We undertook the yeast two 
hybrid analysis to investigate whether these in vitro results 
could be recapitulated in vivo. The results from the yeast two 
hybrid system were qualitatively similar to those that were 
10 performed using phage display on purified ERof protein. 

fi Yeast strains and genetic manipulations 

References for plasmids and strain 
Cloning vector pJG4-5 

Genbank Accession number: U89961 

iy 

M 15 Reference: Gyuris,J., Golemis,E., Chertkov,H. and Brent, R. 

Cdil, a human Gl and S phase protein phosphatase that^ 
h associates with Cdk2 . Cell 75 (4), 791-803 (1993) 



Cloning vector pEG2 02 (pLexA) , complete sequence. 

Genbank Accession number: U89960 

2 0 AUTHORS Golemis,E., Gyuris,J. and Brent, R. 

TITLE Interaction trap/two-hybrid systems to identify 

interacting proteins 

JOURNAL Unpublished 



pJK103 

25 Reference: J. Kamens and R. Brent A yeast transcription 

assay defines distinct rel and dorsa.1 DNA recognition 
sequences. New Biol. 3:1005-1013 (1991). 
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Yeast Strain EGY4 8 

Reference: Gyuris,J., Golemis,E., Chertkov,H. and Brent, R. 
Cdil, a human Gl and S phase protein phosphatase that 
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associates with Cdk2 . Cell 75 (4), 791-803 (1993) 

The yeast strains used in this study was EGY4 8 ( MAT 
Of trpl his3 ura3 leu2 : : 6lexAops-LEU2 ) purchased from 
OriGeneTechnologies for yeast two hybrid analysis. This strain 
5 contains 6 LexA operators upstream of the LEU2 gene in the 
yeast genome and provides high sensitivity in detecting 
protein-protein interactions in the LexA two hybrid system. 
The plasmids used in this study were pEG2 02 (LexA-DNA binding 
domain) , pJG4-5 (B42 - activation domain) , and the plasmid 
10 containing a jS-galactosidase reporter, pJK103 (OriGene 
. Technologies) . The full length estrogen receptor was subcloned 
in frame into the EcoRI and Xhol sites of pJG4-5 to generate 
an ERQf-B42 activation domain fusion. ERof was subcloned in the 
activation domain plasmid because ERa was able to autoactivate 
15 reporters when fused to the LexA DNA binding domain. 

The peptide sequences used in this study were generated 
from synthetic oligos filled in by T7 Sequenase (Life 
Science) and subcloned into the EcoRI-XhoI sites of pEG202. 
The synthetic oligos were: 

20 F6, 5'- 

GACTGTGCGAATTCGGTCATGAACCATTAACTTTATTAGAAAGATTATTAATGGATGATA 
AACAA.GCTGTTCTCGAGCGTGTCAG ; 

q;II, 5'- 

GACTGTGCGAATTCTCTTCTTTAACTTCTAGAGATTTTGGTTCTTGGTATGCTTCTAGAC 

2 5 TCGAGCGTGTCAG ; 

Qf/iSlII, 5'- 

GACTGTGCGAATTCTCTTCTTGGGATATGCATCAATTTTTTTGGGAAGGTGTTTCTAGAC 
TCGAGCGTGTCAG ; 

Qf/j8V, 5'- 

3 0 GACTGTGCGAATTCTCTTCTCCAGGTTCTAGAGAATGGTTTAAAGATATGTTATCTAGAC 

TCGAGCGTGTCAG . 

The complementary synthetic oligo used to generate double 
stranded DNA was 3'XhoPrim, 5 ' -CTGACACGCTCGAG . Each 5' -oligo 
was annealed to the 3' -oligo by heating to 90 °C for 15 minutes 
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and cooled slowly to 35 °C. T7 sequenase was added and the 
fill-in reaction allowed to proceed at 30 °C for 30 minutes. 
The reaction was terminated by heat denaturing the enzyme at 
65 for 1 hour, restriction digests were performed and the 
5 resulting DNA fragments subcloned into pEG202 to generate 
peptide-LexA DNA binding domain fusions. 

Yeast cells were transformed by the method of Ito et 
al. (J. Bacterid. 153: 163-168 (1983)) and grown on selective 
media . 

10 jS-galactosidase activity assays 

10 ml cultures of yeast strain EGY48 containing p JG4 - 
5 ERckf pJK103 and pEG202-F6, -ofll, -Qi/(2J.ll, or -of/jS V were grown 
overnight at 30 °C in selective media containing 100 nM 
estradiol, 4 -OH tamoxifen, or tamoxifen citrate with galactose 
15 as the carbon source. The culture was diluted to -2x10^ 
cells/ml in the same media and allowed to grow at 30 °C until 
the cultures reached a density of -1x10^ cells/ml (-4 hours) . 
The yeast cells were pelleted by centurif ugation, washed with 
extraction buffer (60 mM NasHPO^, 4 0 mMNaH2P04, 10 mM KCl , 1 mM 

2 0 MgS04, ImM PMSF, 7 mM 2 -mercaptoethanol ) and suspended in 2 00 

^il of extraction buffer. 100 /xl of acid-washed glass beads 
were added and cells were lysed by vigorous agitation for 10 
minutes at 4 °C. Cellular debris was pelleted by 

centrif ugation and the supernatant transferred to a clean tube. 
25 10 /xg of total cellular protein was diluted into complete Z 
buffer (60 mM Na2HP04 , 4 0 mMNaHsPO^, 10 mM KCl, 1 mM MgSO^, 7 mM 
2 -mercaptoethanol ) to a volume of 100 /xl in a 96 -well 
microplate. 80 /xg of o-nitrophenyl -/S-D-pyranoside (ONPG) in 
20 /xl was added to each well to initiate color development. 

3 0 The reaction was stopped by the addition of 3 0 /xl IM NasCOa and 

the time for development was noted. /3-galactosidase activity 
was determined by measuring the absorbance at 4 05 nm. 

Yeast cultures were grown in the presence or absence of 
lOOnM estradiol, 4 -hydroxy- tamoxifen , or tamoxifen citrate and 
35 protein extracts prepared as described in methods. 10/xg of 
each protein extract was assayed for jS-galactosidase activity 
using o-nitrophenyl-b-D-pyranoside as substrate. Activity 
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units are defied as 1000* Abs405/minutes/mg/protein . 
The results are shown in Table 99. 

The peptides that were used in the two hybrid assay were 
originally isolated by phage display using ER alpha as the 
5 target protein. The isolation procedure was carried out in the 
presence of agonists or antagonists (estradiol, 4 -hydroxy 
tamoxifen,...) which generated a differing set of interacting 
peptides. The interactions of these peptides with ER alpha 
were investigated in vitro using different agonists and 
10 antagonists. The interaction profile generated by these in 
vitro studies allows us to use these peptides as probes for the 
physical state of the estrogen receptor. The two hybrid assay 
discerns whether these interactions can be maintained within 
the cell. 

15 The results from the two hybrid experiments show a 

qualitatively similar interaction profile between the peptides 
and ER alpha as determined in vitro. Therefore, the effects 
of agonists and antagonists on the structure and availability 
of peptide binding sites on ER alpha is maintained in vitro and 

20 in vivo. These results allow interpretation of the structural 
state (activated or antagonized) of ER alpha in response to 
various compounds. The results using known activators and 
antagonists can be used to identify other unkov/n compounds as* 
agonists or antagonists in a drug screen. The availability of 

25 more peptides and use of other known agonists and antagonists 
will generate better tools for identifying possible compounds 
or drug leads . 
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Example 4 Use of Ma.mina.lian Two-Hybrid Assays to Explore ER 
Activation 

The estrogen receptor (ER) plays an important role in both 
normal and pathological processes of human development and 
5 disease. Clinically, ER antagonists such as tamoxifen have met 
with much success in the treatment of ER containing breast 
cancers. However, resistance to tamoxifen usually develops 
within 2-5 years after initial treatment. A potential 
mechanism of resistance may be the ability of tumors to switch 
10 from recognizing tamoxifen as an antagonist to responding to 
it as an agonist . 

In this regard, several new tissue specific antiestrogens 
"5 have been developed which may have clinical utility in the 

treatment of tamoxifen refractox^y breast cancers. 

-.J 15 In an attempt to identify novel high affinity ligands 

J=-i which target the ER/tamoxifen complex, we have employed the use 

il of phage display to screen for random peptides which will 

- recognize the specific ER conformation induced by tamoxifen.' 

We have isolated a series of 15mer peptides which can recognize 
ry 20 this complex . Furthermore , these peptides are able to form 

^2 complexes in vivo with ER as assessed in the mammalian two- 

Q hybrid system. Using various ER mutants, we have mapped the 

peptide interaction surface to the hormone binding domain! 
Importantly, we have demonstrated that expression of these 
25 peptides can block the partial agonist activity of tamoxifen 
in cells transfected with ER. Although the mechanism by which 
these peptides block ER/tamoxifen transcriptional activity 
remains unknown, it appears that DNA binding and ER stability 
are not affected by peptide expression. Therefore, it is 
30 possible that these peptides may be targeting a functionally 
active site present in the ER/tamoxifen conformation. 

This makes possible a novel approach in the development 
of rational drug design for the treatment of tamoxifen 
refractory breast cancers. Traditionally, only molecules which 
35 interact with the ligand binding pocket have been considered 
for the development of novel ER antagonists. In addition to 
these traditional "hormonal" agents, we propose that the 
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ability to target the specific receptor conformation induced 
by hormones will result in the development of therapeutically 
important novel pharmaceuticals. Furthermore, these findings 
may be applied to other nuclear receptors for which 
5 transcriptional interference may be clinically useful. 

Exampl e 4.1 

Figure 7 shows the development of a cell -based assay to 
assess peptide-receptor interactions. Peptide sequences 
representing each class was fused to the DNA binding domain 

10 (DBD) of the yeast transcription factor Gal4 . HepG2 cells were 
then transiently transfected with expression vectors for ERa- 
VP16 and the Gal4 -peptide fusion proteins. In addition, a 
luciferase reporter construct under the control of 5 copies of 
a Gal4 upstream enhancer element was also transfected along 

15 with a pCMV-jS galactosidase vector to normalize for 
transfection efficiency. Transfection of the Gal4 DBD alone 
is included as control. Cells were then induced with various 
ligands as indicated in the figure and assayed for luciferase 
activity and jS galactosidase activity. Normalized response was 

20 obtained by dividing the luciferase activity by the jS 
galactosidase activity. 

Results. ERo! does not interact with the Gal4 DBD alone 
under any condition. a^l interacts with ER in the presence of 
estradiol and somewhat with the apo- receptor . or II interacts 

25 with the receptor under all conditions with the apo-receptor 
and ICI 182,780 bound receptor showing the least activity. 
III and QfjS V interact almost exclusively with the tamoxifen 
bound receptor. This data in general confirms that obtained 
from the time resolved fluorescent study. Furthermore, the 

30 ability of these peptides to act as conformational detectors 
confirms in the cell earlier observations obtained from 
protease digestion and crystallization studies that the 
receptor undergoes distinct conformational changes when bound 
by different ligands. 

3 5 EKa.mpl e 4,2 
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The specificity of pept ide-nuclear receptor interaction 
was analyzed (Fig. 8) using the mammalian two-hybrid system. 
Experimental design is the same as in figure 2 except that 
either progesterone receptor (PRB-VP16) , estrogen receptor beta 
5 (ERjS-VPlG) , glucocorticoid receptor (GR-VP16) or thyroid 
receptor beta (TRjS-VPlG) was tested. 

Results. All receptors tested, as expected, interact with 
the ot^ I peptide in the presence of the appropriate agonist for 
that receptor. None of the receptors tested interact 

10 significantly with the ot II or ot(3 III peptide. This was 
somewhat surprising considering that ofjS III was originally 
isolated on ERjS. This suggests that the conformation of ER/8 
in the cell may be different from that of the purified receptor 
in vitro. Interestingly, ot^ V was able to associate with both 

15 PRB and ER|S . This peptide bound ER/8 only in the presence of 
tamoxifen but was able to associate with PRB in the presence 
of the PR antagonists RU 486 and ZK 98299. This suggest that 
otp V is capable of recognizing the antagonist conformation of 
a subset of nuclear receptor family members. 

2 0 Exampl e 4.3 

Figure 9 demonstrates that certain peptides which interact 
with the tamoxifen activated estrogen receptor do not require 
AF-2 (Helix 12) of the receptor. Three ERa mutants were 
compared with wild- type ERa. ER-LL was characterized by 

25 mutations L540A/L541A. Mutant ER3X was characterized by 
mutations D538N/E542Q/D545N. Finally, mutant ER-535 STOP was 
truncated after residue V535. These mutations have been shown 
to significantly compromise ER AF-2 transcriptional activity 
and their interaction with several known coactivators . Mutant 

30 ER3X is partially AF-2 active and ERLL and ER535-stop are AF-2 
inactive. Experimental design is the same as in figure 7 
except that either (A) ER3X (B) ERLL or (C) ER535-stop was 
analyzed for binding to conformation sensitive peptides. 

Results. a(3 I peptide is unable to interact with the ERLL 
35 and ER535-stop mutant receptors in the presence of estradiol 
indicating that these mutations may abolish coactivator 
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binding. ofjS I peptide retains some ability to engage the ER3X 
mutant receptor in the presence of estradiol suggesting that 
these mutations significantly lower the affinity of the 
receptor for coactivators but does not destroy this 
5 interaction. These findings are consistent with the 

transcriptional properties of these receptors. a II peptide 
binding specificity is largely unaffected by any of the 
receptor mutations tested. Interestingly, the a(3 111 and a(3 
V peptides specificity of interaction is modified with each 

10 successive mutation resulting in a loss of the tamoxifen 
specificity and resulting in the ability of these peptides to 
engage the receptor in the presence of many of the ligands 
tested. These results suggest that although helix 12 is not 
required for the binding of peptides which recognize the 

15 conformation induced by tamoxifen that normal helix 12 
structure is required for the specificity of interaction of a(3 
III and al3 V peptides. 

Exa^mpl e 4.4 

Figure 10 studies the disruption of ER mediated 
20 transcriptional activity by Gal4-peptide fusion proteins. 
HepG2 cells were transfected with the estrogen responsive C3 - 
Luc reporter gene along with expression vectors for ERce and p 
galactosidase . Cells were induced with either estradiol or 
tamoxifen as indicated in the figure and analyzed for 
25 luciferase and jS galactosidase activity (lOA) . 

Then HepG2 cells were transfected as above except that 
expression vectors for Gal4 -peptide fusions were included as 
indicated in lOB. Control represents the transcriptional 
activity of estradiol (10 nM) activated ER in the presence of 

30 the Gal4- DBD alone and is set at 100% activity. Increasing 
amounts of input plasmid for each Gal4 -peptide fusion is also 
shown with the resulting transcriptional activity presented as 
% activation of control. Data is averaged from three 
independent experiments (each performed in triplicate) with 

35 error bars representing standard error of the mean, subfig. C 
is same as in (B) except that 4 -OH tamoxifen was used to 
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activate the receptor. 

Results. Tamoxifen displays partial agonist activity in 
HepG2 cells. This activity is up to 30% of that exhibited by 
estrogen. ot^ I and a. II peptides are able to inhibit the 
5 ability of estradiol to activate transcription up to 50% under 
the conditions of this assay. It is not surprising that the 
QfjS I peptide inhibits ER activity due to the fact that it 
probably competes for coactivator binding. The ability of ot 
II peptide to disrupt ER transcriptional activity may suggest 

10 that this peptide recognizes some pocket in the receptor that 
is also important for coactivator binding. The inability of 
QfjS III and QfjS V to block estradiol mediated transcription 
correlates well with their inability to bind the receptor when 
bound by estradiol. Interestingly, a II, oi(i III and or/S V are 

15 able to efficiently block the partial agonist activity of 
tamoxifen while ofjS I is not. These findings are in agreement 
with the binding characteristics of these peptides and may 
suggest that the pocket (s) recognized by these peptides are 
important for the ability of tamoxifen to behave as a partial 

20 agonist . 

Example 4 . 5 

Figure 11 shows disruption of tamoxifen activated ER 
transcriptional activity by ofll peptide is not promoter 
dependent. Experimental design is the same as in figure 6 
25 except that the ability of q:II peptide to inhibit tamoxifen 
(lOnM) activated transcription was tested on several distinct 
promoters including Ix-ERE-Luc, 3X-ERE-Luc, TK-ERE-Luc and C3 - 
Luc . 

Results. The ability of ot II peptide to block tamoxifen 
30 activated transcription is not dependent on the context of the 
promoter. This peptide blocks tamoxifen partial agonist 
activity from all promoters tested. 

Exampl e 4.6 

Figure 12 shows disruption of ER mediated transcriptional 
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activity through the AP-1 pathway by Gal4 -peptide fusion 
proteins. (A) HepG2 cells were transfected with the AP-1 
responsive collagenase reporter gene construct (pCOL-Luc) and 
expression vectors for ERor and jS-galactosidase . Cells were 
5 then induced with either estradiol or tamoxifen as indicated 
in the figure and assayed for luciferase and )S-galactosidase 
activity and normalized as detailed in figure 7. (B) Same as 
(A) except that Gal4 -peptide fusion constructs were also 
transfected as indicated in the figure. Control represents the 

10 transcriptional activity of either estradiol or tamoxifen 
(lOOnM) activated ER in the presence of the Gal4 DBD alone and 
is set at 100% activity. The transcriptional activity of 
estradiol and tamoxifen is shown in the presence of each Gal4- 
peptide fusion with the resulting transcriptional activity 

15 presented as % activation of control. Data presented is from 
a single representative experiment . 

Results. Both estradiol and tamoxifen are able to 
activate transcription from the AP-1 responsive collegenase 
reporter gene. This activity is manifest in the absence of an 

2 0 estrogen response element (ERE) and is believed to occur 

through some mechanism involving an interation between ER and 
the AP-1 proteins Fos and Jun. As with the C3-Luc reporter 
gene, each peptide is able to inhibit ER mediated 
transcriptional activity according to its ability to interact 
25 with the receptor in a ligand dependent manner. Those peptides 
which interact with the estradiol bound receptor inhibit 
estradiol mediated transcription while those which interact 
with the tamoxifen bound receptor inhibit tamoxifen mediated 
transcription . 

3 0 EDcampl e 4.7 

Figure 13 presents a model of the potential mechanisms by 
which peptides block the partial agonist activity of tamoxifen. 
(A) Model of the activation pathway by whioh tamoxifen exhibits 
partial agonist activity. Upon binding tamoxifen (T) , the 

3 5 receptor undergoes a conformational change which allows it to 

interact with soma as yet unidentified coactivator protein. 
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This protein in turn transmits a signal to the general 
transcription machinery which results in activation of 
transcription. (B) In this model of inhibition, the receptor 
undergoes a conformational change when bound by tamoxifen but 
the coactivator protein is unable to engage the receptor due 
to competition for the same site by the peptide. (C) In this 
model of inhibition, the receptor undergoes a conformation 
change in the presence of tamoxifen which results in the 
formation of distinct pockets on the receptor. One pocket 
which is distal to the coactivator binding site interacts with 
the peptide. As a result of this interaction, an additional 
conformational change occurs precluding the interaction between 
the coactivator and the receptor. 



Exa.mpl e 5 

Figure 20 shows a similarity analysis of the data pictured 
in Figure 7. Each ligand has a five element footprint, the 
elements corresponding to the normalized transcriptional 
response which it induced in a mammalian two-hybrid system 
presenting either the apo-receptor (control) or the receptor 
in the presence of one of the peptides ajSl, a2 , a(33 or ofjSS . 
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Example 101 

One of the distal steps in transcriptional activation by 
estrogen receptor (ER) is the recruitment by ligand-bound 
receptor of one of a number of coactivator proteins. This 
5 activity permits ER to interact with the general transcription 
machinery and exert its regulatory actions on target gene 
promoters. It has now emerged that one effect of agonist 
binding is to induce a conformational change within ER, 
permitting the interaction of ER helices 3 and 12, and the 

10 subsequent formation of a pocket which allows the coactivator 
proteins to dock. These observations suggest that receptor 
antagonists inhibit ER transcriptional activity by affecting 
the formation of the coactivator binding pocket and reducing 
the affinity of ER for coactivators . Although an ER-specific 

15 coactivator protein remains to be identified, several 
coastivaators have been identified which potentiate the 
transcriptional activity of ER and other members of the steroid 
receptor superfamily. Furthermore, the finding that these 
coactivators use a highly conserved LXXLL motif to interact 

20 with the receptors made it uncertain as to whether receptor- 
cofactor interactions were determined by simple competition or 
if there was some specificity built into the system. 

In order to address these possibilities, we undertook a 
molecular approach to dissect the LXXLL-ER interaction and to 

25 evaluate the role of flanking sequences in influencing these 
interactions. We utilized phage display technology to screen 
10 X 10*^ variations of the core LXXLL motif. Using estradiol- 
activated ER as a target, we identified a number of phage which 
encoded high affinity ER-interacting peptides. Using the 

3 0 sequence information derived from these phage, we constructed 
a series of GAL4 -peptide fusions and assessed their ability to 
interact with ERof, ER/3, GR and PR using a two-hybrid assay in 
mammalian cells. The results of this assay confirmed that the 
LXXLL motif was permissive for nuclear receptor binding but it 

35 also revealed that sequences flanking this motif were important 
determinants of specificity. Thus, as expected, not all LXXLL 
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motifs are the same. This suggests that within a cell, 
specificity and not just mass action influences the ability of 
a nuclear receptor to find a required cof actor. In an effort 
to understand the mechanism underlying this observed 
5 specificity, we assayed the ability of these peptide fusions 
to interact with a series of ER helix-12 mutants. Using this 
approach we noticed that mutation of the conserved hydrophobic 
residues in this helix abolished ER-AF-2 function and blocked 
the interaction of all LXXLL peptides with ER. Disruption of 

10 helix 12 by mutating the three conserved charged residues 
(D538N/E542Q/D545N) prevented most peptides from binding and 
also abolished AF-2 function. However, a large number of the 
LXXLL- containing peptides studied were unaffected by this 
manipulation. This is an important observation since the 

15 latter mutation also blocks the interaction of ER with GRIP-1 
and SRC-1. Cumulatively, our data indicate that the steroid 
receptors display distinct preferences for different classes 
of LXXLL motifs, suggesting a molecular basis for cof actor- 
receptor specificity. Importantly, however, they also indicate 

20 that AF-2 function and coactivator binding are not synonymous, 
a result which indicates that there are likely to be additional 
cof actors distinct from SRC-1 and GRIP-1 which remain to be 
discovered. 

Plasmids : All the Gal4DBD-peptide fusions were 

25 constructed as follows: DNA sequences code for the peptides 
were excised from mBAX vector with Xhol and Xbal restriction 
enzymes and subcloned into pMsx vector derived from pM vector 
(Clontech) with a linker sequence to generate infram Sail and 
Nhel sites for cloning. The fusion constructs expressing two 
30 copies of LXXLL motifs, 2xF6 and 2x293, were derived from their 
corresponding single-copy peptide-DBD fusion plasmids by adding 
a linker sequence (adapted from the sequences found in-between 
the GRIP-1 NR box 2 and box 3) . Subsequently, a second copy 
of the LXXLL peptide was added, resulting in the two copies of 
35 LXXLL motifs being separated by 50 amino acids, the same 
spacing found in-between the GRIP-1 NR box 2 and 3. VP16ER-Qf 
construct was generated by polymerase chain reaction (PGR) of 
full length human ER-cx cDNA with primers containing ECoRI 




162 



flanking both 5' and 3' ends. The PGR product was then 
subcloned into pVP16 vector (Clontech) to generate the VPlS-ERo; 
fusion with VP16 located at the N-terminus of ERof cDNA. 
Plasmids pVP16ER-i8, pVPlS-RARo;, and pVPlG-RXRof were generated 
5 in a similar fashion. pVP16VDR is a generous gift from J. W. 
Pike (University of Cincinnati, Cincinnati, OH) ; VP16TRj8 
expression plasmid (pCMX-VP-F-hTR/S) was provided by D.D. More 
(Baylor College of Medicine, Houston, TX) ; VPlGGr, VP16PR-a, 
VP16PR-b, and VP16AR were gifts from J. Miner (GR) , D.X. Wen 

10 (PR-a and PR-b) , and K. Marschke (AR) (Ligand Pharmaceuticals, 
San Diego, CA) . VP16-ER mutant constructs were generated by 
excision of mutant ER cDNAs from ER expression plasmids (ER- 
TAFl, ER-LL and ER-535 stop plasmids, Tzukerman et al . Mol . 
Endocrinol. 1994 ( 8 ): 21 -30 and Norris et al . , J. Biol. Chem. 

15 (273) : 6679-6688 , 1998) and subcloned into pVP16 vector. 
Mammalian expression plasmids for ERck!, Erj8, and ER179C, as well 
as 3xERE- TATA- Luc receptor construct were described elsewhere 
(Tzukerman et al . Mol. Endocrinol. 1994 (8) : 21-30) . 5xGal4Luc3 
construct was modified from 5xGal4 -TATALuc plasmid (a gift from 

20 X.F. Wang, Duke University, Durham, NC) where the lucif erase 
gene was replaced by a modified version of luciferase cDNA from 
pGL3 basic vector (Promega) . GRIP-1 and SRC-1 constructs were 
generated by subcloning PCR products corresponding to GRIP-1 
a. a. 629-760 and SRC-1 a. a. 621-765 inco pM vector (P.H. 

25 Giangrande, unpublished) . All PCR products were sequenced to 
ensure the fidelity of the resultant constructs. 



Example 101.1 

Construction of Phage Library. A focused random peptide 
library (X7-LXXLL-X7 X=any AA, L=Leu) was constructed and 

30 displayed on M13 phage using the M13 phage-based cloning vector 
mBAX, see Tora, et al . , Cell, 59:477-87 (1989) . The top strand 
oligonucleotide 5'- 
AGTGTGTGCCTCGAGA (NNK) 7CTG (NNK) 2CTGCTG (NNK) ^TCTAGACTGTGCAGT- 3 ' 
(N=A, C, G, or T; K= C or T) was purchased from Life 

35 Technologies, gel purified and annealed to its complementary 
strand oligonucleotide 5'- ACTGCACAGTCTAGA-3 ' . The resulting 
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DNA complex was extended by Klenow polymerase in the presence 
of dNTPs to generate double- stranded DNA, and was subsequently 
digested with Xho I and Xba I and ligated into mBAX vector, 
previously digested with the same restriction enzymes. The 
5 ligated products were electroporated into E. coli JS-5 cells 
and amplified on 2YT plates for 6 h to create the (X) 7LXXLL (X) ^ 
peptide library. The amplified phage were then eluted from the 
plates by phosphate buffered saline (PBS) , concentrated, and 
finally re-suspended in 20% glycerol/PBS and stored in -70*^C 
10 in 500 /il aliquots. The library has a complexity of 1.5 x 10® 
different peptide sequences. 

Affinity Selection. Baculovirus expressed full length ER- 
Q? was provided by PanVera Corporation (Madison, WI) . 
Approximately 0.25 fig (4 pmole) of ERcv was diluted in lOOjUl of 
15 NaHCOa (pH=8.5) plus 10'^ M IV/S-estradiol , applied to a single 
well in a 96 -well Immulon 4 plate (Dynex Technologies, Inc.) 
and incubated at room temperature for 3 h. An equal amount of 
BSA was added to the adjacent well as a control target. The 
wells were blocked with 150 fil of 0.1% BSA in NaHC03 for an 

2 0 additional hour at room temperature, and washed 5 times with 

PBST (13 7 mM NaCl , 2 . 7 mM KCl , 4.3 mM Na2HP04 , 1.4 mM KH2PO4, 
pH=7.3 and 0.1% Tween 20) to remove excess protein. Twenty 
five microliters of the phage peptide library (with > 10^° 
phage particles) diluted in 125 /il PBST with 10*^ M 17/8- 

25 estradiol and 0.1% BSA was then added to the wells, and the 
plate was sealed and incubated for 8 h at room temperature . 
Non-binding phage were removed by washing the wells 5 times 
with PBST, The bound phage were eluted by 100 fxl pre-warmed 
(50^0 50 mM glycine-HCl (pH=2.0), followed by 100 ixl of 100 

30 mM ethanolamine (pH=11.0) . The first eluent was neutralized 
by adding 200 /xl of 200 mM Na2HP04 (pH=8.5) and combined with 
the second eluent. Phage eluted from the targets were 
amplified in E. coli DHSceF' cells for 8 h, and the supernatant 
containing amplified phage was collected for use in subsequent 

3 5 rounds of panning. A total of three rounds of panning were 

performed. Enrichment of ER binding phage was confirmed by 
ELISA as described below. Individual phage were plaque 
purified after the third panning and the peptide sequences were 
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deduced by DNA sequencing. 

ELISA. Phage binding characteristics were examined by 
ELISA. Full-length ERo! (0.4 pmole per well) was activated by 
different ER-ligands and coated on 96-well Immulon 4 plates as 
5 described above. Fifty microliters of phage stock was applied 
to the wells and incubated with the targets for 1 h at room 
temperature. Unbound phage were removed by 5 washes of PBST . 
A 1:5000 dilution of horseradish peroxidase-conjugated anti-M13 
antibody (Amersham) /PBST was added to the wells and this was 

10 incubated for 1 h at room temperature, followed by 5 washes of 
PBST. Bound ant ibody- enzyme conjugate was detected by ABTS 
(2', 2 ' -azino-bis-3 -ethylbenzthiazoline-6-sulf onic acid) in the 
presence of 0.05% H2O2, and the color change was measured at 
4 05 nm on a plate reader (Multiskan MS, Labsystems) . Phage 

15 that bound to ER only in the presence of estradiol were 
selected and the peptide sequences were deduced by DNA 
sequencing (Table 101) . 

The LXXLL motif -containing peptides are major binding 
species in the affinity selection when estradiol activated ERor 
20 was used as a target. ER4 (Table 101) binds to agonist 
occupied ER but not partial agonist- or antagonist-occupied ER . 

The transcriptional activity of ER within target cells is 
influenced by its ability to interact with specific factors 
that decrease (co-repressors) or increase (co-activators) its 

25 transcriptional activity. Over the past few years, the 
application of various molecular biology approaches have led 
to the discovery of co-activators that interact with the 
nuclear receptor hormone binding domain (HBD) through a 
conserved LXXLL motif in a ligand-dependent manner. In this 

3 0 study, we used a combinatorial phage display approach to 
determine how flanking sequences influence the LXXLL 
motif /receptor interactions. The advantages of using this 
approach are twofold: a vast number of sequences can easily be 
assessed and, more importantly, sequences obtained from this 

35 type of screening often reflect sequences that can be found in 
nature. Specifically, a 19-mer phage "focused" library was 
constructed in which the LXXLL motif was flanked on each side 
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by seven random amino acid residues. 



The resulting phage 



library was used to select for peptides that bound with high 
affinity to estradiol-activated ERof. Phage particles that 
bound specifically to ERo; in a ligand- dependent manner were 
5 selected, amplified, and the amino acid sequences were deduced 



peptide sequences derived from the isolated phage. Based on 
sequences flanking the core LXXLL motif, three different 
sequence clusters have emerged. Class I peptides contain a 

10 conserved serine at the -2 position and a positively-charged 
residue (R) at the -1 position. Class II peptides have a 
proline occupying the -2 position and a hydrophobic leucine (L) 
residue directly preceding the LXXLL motif. Two out of the 
three peptides in class II also contain a charged histidine (H) 

15 at the -3 position, which appears to have an influence on their 
binding characteristics (see discussion) . Class III peptides 
share a conserved serine (S) or threonine (T) at the -2 
position followed by a hydrophobic leucine (L) or isoleucine 
(I) at the -1 position. In these initial characterizations, 

20 we utilized the intact bacteriophage to evaluate the ERof 
binding properties of these peptide sequences. In order to 
show that the peptide alone is both necessary and sufficient 
for ER- binding, we subcloned representative members of each 
class of peptides as fusion proteins to bacterial alkaline 

25 phosphatase, and demonstrated that the purified recombinant 
peptide -enzyme fusions interacted specifically with ERo:. 

Example 101.2 

We next developed a series of mammalian two-hybrid assays 
to confirm that the LXXLL -containing peptides identified could 

3 0 interact with ERof in the context of the intact cell. For this 
purpose, full-length ERo? was expressed as a fusion protein to 
the VP16 acidic activation domain, and the peptide sequences 
were produced as fusions to the yeast Gal4 DNA binding domain 
{Gal4DBD) . Interaction between ERckf-VP16 and the LXXLL-Gal4DBD 

35 fusions was assessed using the 5xGal4Luc3 luciferase reporter 
gene that contains five copies of Gal4 responsive element 



following DNA sequencing. 



Table 101 shows representative 
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upstream of a simple TATA box. 

HepG2 cells were transiently transfected with (Fig. 14A) 
ERof expression vector and reporter 3xERELuc or (Fig. 14B) 
Gal4DBD-ER4, VPlS-ERor and 5xGal4Luc3, and treated with 
5 different ER ligands. Lucif erase activity was normalized to 
the activity of the cotransf ected pCMVjSgal . The ability of ER4 
to interact with ER-of in the presence of different ER-agonists 
paralleled that of ER transactivat ion function as assayed with 
3xERELuc reporter. However, partial agonists or antagonist 
10 inhibited this interaction (Fig. 14C) and (Fig. 14D) . 
Therefore, the LXXLL containing peptides provide a sensitive 
probe for AF2 activation. 

We will now discuss these results in more detail. Shown 
in Figure 14 are comparisons of the ability of different 
15 ligands to activate ERof transcription through a classical ER 
responsive element (ERE) (Figure 14(A)), and their ability to 
facilitate the interaction of the LXXLL peptide (class I-ER4) 
with ER {Figure 14(B)) . 

All steroidal and non-steroidal ER agonists strongly 
20 activated transcription from the 3x- ERE -TATA- Luc reporter 
(Figure 14(A)), while the. s.elective estrogen receptor 
modulators (SERMs) 4 - hydoxy tamoxifen and GW7604, displayed 
minimal agonist activity within this prom.cter context {Figure 
14(C)). The pure antagonist, ICI 182,780, as expected, 
25 functioned as an inverse-agonist that suppressed the 
transcription below the ba^sal, no hormone treatment level 
(Figure 14 (C) ) . 

When analyzing the interaction between the LXXLL motif and 
ERof, we observed a low but significant basal level of 

3 0 interaction in the absence of any ligand treatment, indicating 
that some of the expressed ERa is already in an active 
conformation, allowing the LXXLL peptide to interact. At the 
present time, we do not know whether this basal activity is 
caused by residual estrogens present in the charcoal-stripped 

35 serum, or is due to alternative pathways that activate ER- 
mediated transcription. Above the basal level, however, we 
observed that the interaction of the LXXLL peptide with ERor was 
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entirely ER agonist -dependent . 

The ability of both steroidal and non-steroidal ER- 
agonists to promote ERof/LXXLL- peptide interaction parallels the 
ability of these compounds to activate ERa-mediated 
5 transcription through a classical ER-ERE-mediated pathway. 
This indicates that all of these compounds are mechanistically 
similar, inducing similar conformational changes within ERa, 
and that within target cells these ligand-receptor complexes 
are likely to recruit the same co-activators. 

10 Interestingly, none of the ER-antagonists or SERMs tested 

were able to facilitate ERof-LXXLL interactions. The pure 
antagonist ICI 182,780 totally abolished both basal peptide/ERo: 
J'^ interactions and ERof-mediated transcription {Figure 14 (C) and 

T IS? 

(D)). In addition, although -SERMs such as 4- hydroxy tamoxifen 

..... 

;r 15 and GW7604 can manifest partial agonist activity m certain 

id cell-types and promoter-contexts {Figure 14 (C) and data not 

shown) , in this experiment they actually drove the receptor 
g into a conformation which prohibited LXXLL peptide/ERo: 

H interactions from occurring. As a result, the basal level of 

f3 

m 2 0 interaction between ERo; and peptides containing the LXXLL motif 

tQ was abolished in the presence of these compounds {Figure 14 

(D) ) . 

The crystal structures of raloxifene- , tam.oxif en- , and 
estradiol -activated ERo! HBD have recently been solved and 

25 indicate that the co-activator binding groove within the 
receptor is occupied by a mispositioned helix 12 upon 
antagonist binding. Helix 12 of the receptor thus prevents the 
co-activator LXXLL motif from interacting. Although some of 
our peptides seem to bind strongly to ERof in the presence of 

3 0 estradiol, none of them were able to interact with ERo! in the 
presence of any of the SERMs tested, including 4- 
hydroxytamoxif en, nafoxidine, raloxifene, GW7 6 04, and 
clomiphene. Therefore, the partial agonist activity manifest 
by these compounds in some cells is likely to require cofactors 

35 distinct from those required by estradiol -activated ER. These 
data support the notion that the ability to facilitate the 
interaction of ER with LXXLL-containing co-activators is a 
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fundamental step common to both ligand-dependent and basal 
transcriptional activity mediated by ERof . The observation that 
ER-peptide interactions do not occur in the presence of ER- 
antagonists or mixed agonists may explain why compounds like 
5 tamoxifen and ICI 182,780 can inhibit both basal and ligand- 
dependent activation of ER. 

We also conducted the same analysis using other LXXLL- 
containing peptides and observed similar results. 

Example 101.3 

10 We next examined whether all of the LXXLL- containing 

peptides selected using phage display were functionally 
equivalent. The previously defined ternary structures of the 
H LXXLL motifs co-crystallized with either the ERof or PPARy HBD 

indicated that these motifs bind to a hydrophobic groove 
15 created by helices 3, 4, 5 and require an intact helix 12. 
y Therefore, the ability of the LXXLL motifs identified to 

Y"^' interact with the co-activator binding groove was assessed 

- using a modified mammalian two-hybrid assay. Several ERof' 

H mutants with alterations in the helix 12, as well as the wild 

20 type ERa were produced as VP16- fusion proteins to test their' 
CO ability to recruit LXXLL motifs. 

Selected peptide sequences and different ER mutants (ER- 
LL, ER-3X, ER-535 STOP) were expressed as fusion proteins to 
Gal4DBD and VP16 (TAD) , respectively. In ER-LL, the LL at 539- 

2 5 4 0 are replaced by AA. In ER-3x, we have the mutations 

D538N/E542Q/D545N. In ER-535 stop, the ER is truncated after 
residue 534, i.e., codon 535 is a stop codon. The binding 
affinity of different peptides (ER4, F6 , D47, C33, D22, D48) 
to wild type ER and the three ER mutants were measured by the 
30 expression of 5xGal4Luc3 reporter construct. GRIP-1* and SRC- 
1* constructs contain the center 3 copies of LXXLL motif (a. a. 
629-761 for GRIP-1 and a. a. 622-765 for SRC-1) fused to 
Gal4DBD. The results are shown in Figure 15. 

We found that all of the peptides tested interacted with 

3 5 wild type ERa in a ligand-dependent fashion. As expected, the 

middle three copies of the LXXLL motif (NR-box) found in co- 
activators SRC-1 and GRIP-1 also interacted in a similar 
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fashion. Western analysis showed that different classes of 
peptide-Gal4DBD fusion proteins have different expression 
levels in the cells; therefore, data presented in this assay- 
can only be used to compare their binding patterns, but not 
5 their relative binding affinities (Figure 15 (C) ) . For 
instance, the class II peptides interacted with ERof with 
relatively higher affinity than the Class I and III peptides 
in the in vitro binding assays (data not shown) . The 
expression levels of these peptides, however, are much lower 
10 than the other classes of peptides, which may explain the 
observed lower readout in the mammalian two-hybrid assays. 
Regardless, the mammalian two-hybrid assay remains a useful 
tool to characterize the in vivo interactions between ERo; and 
the peptides. Truncation of the ER helix 12 (ER535 stop) does 
:~ 15 not affect ligand binding or dimerization; however, the ability 

of receptor to interact with any LXXLL peptides was totally 
ly abolished. This was consistent with the observation that helix' 

12 12 is required to form the co-activator binding groove and, 

s more importantly, it implied that all the affinity-selected 

IZ^ 20 LXXLL-containing peptides bind to the same co-activator binding 

fy groove. Furthermore, mutation of a pair of the hydrophobic- 

W residues in helix 12 (L539L540 > A539A540) , significantly 

decreased the ERa transcriptional activity, and these mutations 
also abolished the interaction of ERo; with all of the LXXLL 
25 peptides tested. 

Previously, we and others have demonstrated that 
alteration of the three charged residues in ERo? helix 12 
(D538E542D545 > N53 8Q54 2N54 5 , ER-3x ) abolishes ERof 
transcriptional activity in most cell types (10, 28, 47), and 

30 prevents the interaction of GRIP-1 and SRC-1 type co-activators 
with ERa . Predictably, in our experiments, the interaction of 
the ERq;-3x receptor mutant with GRIP-1 and SRC-1 NR-boxes was 
significantly lower compared to that of the wild type receptor. 
The ability of class I and II peptides to interact with ERof was 

35 also prevented by these specific ERo; helix 12 mutations, 
indicating that they may bind to ERa in a manner which is 
similar to that of the GRIP-1 and SRC-1 LXXLL motifs. In 
contrast, the interactions between class III peptides and ERof 
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was not affected by these mutations. Importantly, the ERQf-3x 
mutant is fully functional in certain cell types, which is 
interesting in light of the observed weak interaction of this 
receptor with co-activators like SRC-1 and GRIP-1. Our 
5 observations suggest, however, that the activity exhibited by 
this mutant receptor might be the result of its interaction 
with cof actors containing class III type LXXLL motifs. 
Regardless, however, it appears that the LXXLL motif is not 
merely a receptor-co-factor docking sequence, but it also 
10 contains information that governs the specificity of these 
interactions . 

Exa.mpl e 101 ,4 

If peptides obtained from phage display are in fact 
mimicking the interactions between ERof and endogenous 
15 cof actors, then they should function in a dominant negative 
manner when co-expressed in cells, disrupting these 
interactions and blocking the ER transcriptional activity. 

HeLa cells were transfected with ERo? expression plasmid 
(RSTVERo?) , 3xERELuc reporter, pCMVjSgal along with different 

20 peptide-Gal4DBD fusion constructs. pM is the Gal4DBD control 
without peptide fusion. Lucif erase activity was measured and 
normalized (Fig. 16) . The ability of different peptides to 
disrupt ERcy transcriptional activity correlates with the 
affinity of these peptides to ERo; as measured in mammalian two- 

25 hybrid assays. Co-expression in HeLa cells of the peptide F6- 
Gal4DBD fusion did indeed decrease the estradiol -induced ER- 
dependent reporter gene expression to approximately 50% of that 
without the peptide. 

Two copies of LXXLL motifs interact synergist ically to 
30 disrupt ER transcriptional activity. 2XF6 : two copies of F6 
peptide was constructed in tandem with a 54 -amino acid spacing 
linker that has the same sequence as in that in between GRIP-1 
NR box II and NR box III. F6G : Gal4DBD-F6 fusion with only one 
copy of F6 peptide plus the linker. Transient transfection was 
3 5 performed in HeLa cells with 3xERELuc, RSTVERo;, pCMV^gal and 
increasing amount of Gal4DBD-pept ide fusion constructs as 
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indicated in the X-axis. 

It was previously suggested that multiple copies of the 
NR boxes in GRIP-1 and SRC-1 can bind to ERof in a synergistic 
manner. The results showed that expression of the center three 
5 copies of the NR boxes from GRIP-1 permitted a more effective 
inhibition of ER-mediated transcription than a single-copy 
peptide. Based on this result, we evaluated the inhibitory 
activity of a construct expressing two copies of the LXXLL 
motif (F6 peptide) on ERof transcriptional activity. The linker 

10 between the two copies was adapted from sequences found between 
the GRIP-1 NR box 2 and NR box 3. When analyzed in target 
cells we determined that fusion proteins (2xF6) containing two 
copies of the F6 peptide were more effective inhibitors of ERo! 
transcriptional activity than those expressing a single copy 

15 of F6 . The 2xF6 was functionally comparable to the construct 
expressing the GRIP-1 NR-boxes which contains three copies of 
the LXXLL motif. The increased efficacy of the 2xF6 as an 
inhibitor of ER function required each of the two LXXLL motifs, 
since addition of the GRIP-1 linker sequence to a single copy 

20 of F6 did not increase its antagonist efficacy. However, we 
have not yet ruled out the possibility that the expression 
levels might be different from one protein to another. 

Example 101.5 

It has been demonstrated by us and others that ER contains 
25 two distinct activation function domains, AF-1 and AF-2, whose 
activities are manifest in a cell-selective manner. In HeLa 
cells, both AF-1 and AF-2 functions are required for maximal 
ER- transcriptional activity, while AF-1 is the dominant 
activation function in HepG2 cells. Our peptide disruption 
30 results closely correlated with these observations. 

LXXLL containing peptides disrupted AF2 function in HepG2 
cells, but did not totally abolish wtER transact ivation 
function in HepG2 cells, where the AF-1 function is dominant 
(Fig. 17) . However, in the same context, the transcriptional 
35 activity of a truncated form of ER (ER179C) . that lacks the AF-1 
domain was diminished- by LXXLL containing peptides. ■HepG2 was 
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transfected with either wtER or ER179C expression plasmids 
along with 3xERELus reporter, Gal4DBD-pept ide fusion 
constructs, and pCMV^Sgal to normalize for transfection 



5 different concentrations of ITjS-estradiol for 16 h before 
assaying . 

In HeLa cells, over-expression of LXXLL- containing 
peptides abolished almost 100% of ER transcriptional activity, 
highlighting the obligate role of AF-2 in ER-mediated function 

10 and that AF-1 is not able to function independently of AF-2 in 
this background. In HepG2 cells, however, we have observed 
that the roles of AF-1 and AF-2 are different. It was 
demonstrated in a previous study that mutations in ER-AF2 that 
block the binding of the co-activators SRC-1 and GRIP-1 with 

15 ER, have no effect on ER- transcript ional activity in HepG2 
cells. We interpreted these data to mean either that 1) in 
this context AF-1 is dominant and AF-2 is not required, or 2) 
in this cell line a co-factor exists whose interaction with ER 
does not require an intact AF-2. In order to discriminate 

20 between these possibilities, we used the LXXLL- containing 
peptides to study the role of AF-1 and AF-2 in ER signaling in 
this background. When either the 2xF6 or GRIP-1 peptides were 
over-expressed in HepG2 cells, they inhibited wtER- 
transcriptional activity; however, not down to the basal 

25 levels. The transcriptional activity was still about 10-fold 
over the basal levels at the highest dose of input peptide- 
fusion plasmid, indicating that some independent AF-1 activity 
is possible in this cell context. This hypothesis is supported 
by the observation that the activity of an ER-mutant lacking 

3 0 AF-1 is inhibited more readily (two- fold over basal level at 
the highest input plasmid dose) by over-expression of either 
of the peptide fusions. The most interesting result, however, 
is that the class III peptide (2xF6) is an efficient inhibitor 
of ER-3x transcriptional activity, whereas the GRIP-1 NR box 

35 peptide is inefficient. Taken together, we observed that the 
class III peptide, F6, interacted with,ER-3x, and that over- 
expression of this peptide inhibited the transcriptional 
activity of this mutant receptor , suggest ing that in HepG2 



efficiency. 



After transfection, cells were induced with 
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cells, a co-factor which contains an F6-like LXXLL motif may 
exist and may be important for ER function. 

Example 101.6 

The interactions between different LXXLL motifs and 
5 different nuclear receptors were assayed in a mammalian two- 
hybrid system (Fig. 18) . Full length receptors and selected 
peptides were expressed as VP16 and Gal4 DBD fusions, 
respectively. The strength of the interactions was measured 
by the activity of the 5xGal4Luc3 reporter gene. NH : no 
10 hormone; H: hormone treatments. Hormones used in this 
experiment: lO""^ M17/3 estradiol for ER-a and ER-jS, lO""^ M 
progesterone for PR-a and PR-b, 10""^ M dexamethasone for GR, 10* 
M9-cis retinoic acid for RAR and RXR, lO""^ M T3 for TR, 10"^ 
M 1, 2 5-dihydroxy Vit.D3 for VDR, and 10"^ M 5a- 
15 dihydrotestosterone for AR. 

ExsLUiple 101.7 

The GRIP-1 and SRC-1 co-activators containing multiple 
LXXLL motifs interacted with most nuclear receptors. 
Alterations of residues surrounding these motifs have been 

20 shown to affect receptor selectivity, thus we next wished to 
define the • sequences within the NR-box which enable it to 
discriminate between receptors using the LXXLL- containing 
peptides identified. For this study, we utilized 

representative members of each class of LXXLL identified from 

25 our focused library along with an LXXLL motif, #293. 

Peptide #293 (ER beta 15e2, sequence SSIKDFPNLISLLSR) was 
affinity selected from phage display of estradiol activated 
ER^ . It contains the LXXLL motif. It showed selective 
interactions with ER/?, TRjS and RARo: but not with other 

3 0 receptors tested as shown in Figure 7. 

A specificity analysis was accomplished by performing 
mammalian two hybrid assays, in which the LXXLL-containing 
peptides were fused to Gal4DBD and the full-length receptors 
were expressed as VP16 fusion proteins. Most steroid receptors 

35 interacted with all three classes of peptides efficiently. The 
lower luciferas_e activity observed with class II peptides is 
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likely related to the lower (-10 fold) expression level of this 



heterodimerization partners, such as RARa, TR(3 , and VDR, 
demonstrated a strong preference for class II over the other 
5 classes of peptides. Interestingly, ERjS also showed the same 
tendency, preferring to interact with class II motifs , 
suggesting that the co-activator binding groove in ERa and ERjS 
may be functionally different. Interestingly, with the 
exception of Dll, the AR interacted only weakly with all the 

10 LXXLL peptides tested, supporting the hypothesis that 
alternative co-activator recruitment methods are used by AR, 
and that the N-terminus is more important than AF-2 in 
recruiting co-activators to the receptor. When the peptide 
#293 was screened against a panel of nuclear receptors, we 

15 found that it showed a strong preference for ER)8, interacted 
weakly with TRjS and RARof, but did not interact significantly 
with the other receptors tested (Figure 7) . Thus, receptor 
specificity can be achieved by. altering sequences flanking the 
core LXXLL motif, and possibly that ERjS-specif ic co-activators 

20 may be found to contain this or a structurally-similar motif. 
To test whether peptide #293 could specifically target ER/8 
transcriptional activity, we over-expressed it as a Gal4-DBD 
fusion protein and assayed its ability to disrupt ERjS - dependent . 
reporter gene expression. Expression of #2 93 had no effect on 

25 ERof mediated gene expression, but the ER(3 transcriptional 
activity was significantly reduced. (Fig. 19) . 

HeLa cells were transfected with either ERof or ER^S 
expression plasmids along with 3xERELuc reporter, pCMVjSgal and 
peptide-DBD fusion constructs as indicated. Cells were treated 

30 with different concentrations of IV^S-estradiol for 16h before 
assaying . 

Similar to the results with ERckf, two copies of the #293 
motif (2x293) disrupted ER^S function more efficiently than a 
single copy peptide . Nevertheless , ERor transcriptional 
35 activity remained unaffected by the expression of 2x293. 
Clearly, not all LXXLL motifs have the same receptor binding 
selectivity. Thus, we believe that receptor-specific LXXLL 
motifs can be „f ound , and be used to target specific co- 



class of peptides (Figure 3 (C) ) . 



Regardless , the RXR 
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factor : receptor interactions . 
Example 101.8 

Interactions between androgen receptor and peptides 
selected in phage display can be monitored in vivo using a 
5 mammalian two hybrid system. In this system, androgen receptor 
is expressed as a carboxy- terminal fusion to a transcriptional 
activation domain derived from the herpes simplex virus VP16 
protein. The peptide of interest (D3 0, Table 101) is expressed 
as a carboxy- terminal fusion to the yeast derived GAL4 DNA 

10 binding domain. The reporter used to monitor pept ide/receptor 
interactions is a luciferase gene downstream of five consensus 
GAL4 binding sites. Luciferase activity is corrected for 
transfection efficiency by including a ^S-galactosidase gene 
driven by a CMV derived promoter as a control . 

15 HUH-7 cells are seeded in media containing 10% fetal 

bovine serum to a density of 50-80% confluency in 24-well 
plates the day before transfection. In all steps, phenol red 
free media containing charcoal dextran stripped fetal bovine 
serum is used as' essentially steroid- free media. Transf ect ions 

20 are carried out in triplicate using lipofectin to deliver a 
total of 3 /xg of DNA per triplicate (50 ng pCMV-bgal, 1500 ng 
5xGAL-luc3, 900 ng pM-peptide, and 500 ng pVP16 -receptor) . The 
transfection media is applied for 6 hours and then removed by 
aspiration. Media is applied to the cells and they are allowed 

25 to recover for 18 hours. Following the recovery period, the 
media is removed and the cells are treated with compound in 
media containing 2% fetal bovine serum for 18-24 hours. The 
cells are washed twice with PBS and then lysed in 65 /xl of 
lysis buffer from Tropix Galacto Light Plus. 

30 Luciferase and /S-galactosidase activities are determined 

by luminescence as described by the vendor (Tropix LucScreen 
and Galacto Light Plus) . 20 jxl samples of clarified cell 
lysate is used in each assay and luminescence determined using 
an LJL Analyst. The luciferase activities are normalized to 

35 the corresponding /3-galactosidase values to generate the 
steroid-dependent peptide/receptor interactions. See Fig. 21. 
The antagonists (flutamide, RU4 86) did not interact. The 
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highest interaction was with an agonist (dihydrotestosterone) . 
Partial agonists (cyproterone acetate, megasterol, 
androsterone) exhibited variable, intermediate interaction. 



Conclusions : 

5 • Peptides with LXXLL motifs have related but different 

activities. Flanking sequences determine: 

1. their affinity for nuclear receptors. 

2 . the requirements for a functional AF2 in ER-o; for 
interaction . 

10 3. their specificity of interaction with different 

nuclear receptors . 

• LXXLL motifs can knock out the estradiol induced ER-a 
transcriptional function in HeLa cells where both API and AF2 
functions are. required for activation. However, in the cell 
15 context where AFl function is dominant (such as HepG2 cells), 
LXXLL motifs cannot totally abolish the estradiol activated 
transcriptional activity. This observation implies two 
possible explanations : 

1. In HepG2 cells, the AFl activity is due to a 
20 different coactivator that contacts primarily the AFl 

region. Therefore, disruption of the interaction between 
SR-Q? and LXXLL-containing cofactors does not disrupt AFl 
function . 

2 . A HepG2 specific cof actor contacts both AFl and AF2 
25 domains. Disruption of the AF2 binding site is not 

sufficient to knock out the interaction of cofactor-ER 
interaction. 

® Peptides with Estrogen Receptor specific LXXLL- 
containing motifs can be obtained by phage display screening 
3 0 and, if active and pharmaceutically acceptable in humans, be 
used as receptor-specific antagonists. 

The identification of ER-associated co-activators and co- 
repressors has helped us understand how different ligands 
acting through the same receptor can manifest different 
35 biological activities. The importance of these proteins in 
mediating ER-pharmacology . was highlighted by our previous 
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studies, which described the identification of different 
classes of peptides whose ability to interact with ER is 
influenced by the nature of the bound ligand. All of these 
interactions represent potential ER-cof actor interactions, and 
5 suggest that ER-pharmacology is more complex than originally 
anticipated. In this study, we have focused on one receptor 
binding motif, LXXLL, and have demonstrated that even within 
this specific core there are multiple classes of functionally 
different LXXLL motifs. Using estradiol-activated ERo;, we 

10 screened 10® variations of the LXXLL motif and identified three 
classes of peptides that interact with the co-activator binding 
pocket within the ERa hormone binding domain. The 
classifications are further substantiated by studies which 
revealed that each class of peptide displayed specific receptor 

15 preferences and that their binding to ERo; was differentially 
affected by ER helix 12 mutations. In spite of their 
differences, the LXXLL-containing peptides all appear to bind 
in an agonist -dependent manner to the same co-activator binding 
groove within ERo; HBD . None of the peptides identified 

20 interact with ER-535stop (helix 12 deletion) or the LL mutant 
(L539L540 > A539A540) . This is not surprising since the co- 
crystal structure of ER with NR box 2 of GRIP-1 shows that 
several residues in helix 12, including L-539, are required to 
make van der Waals contacts between the co-activator groove and 

25 the LXXLL peptide. It is likely that truncation of the helix 
12 or mutations of the paired hydrophobic residues de-stabilize 
such interactions. Furthermore, replacing the three charged 
residues in helix 12 with their corresponding amides (ER-3x) 
disrupts the ability of class I and class II peptides to 

3 0 interact with ER . The ternary structure predicted from the co- 
crystal structure suggests that the conserved glutamic acid 
(E542) in ER helix 12, along with the lysine residue (K362) in 
helix 3, cap the LXXLL peptide in the co-activator binding 
groove through hydrogen bonding to the backbone amides or 

35 carbonyls of the residues on the N- or C-terminal turns of the 
peptide helix. Although the charged side chain is not directly 
involved in the hydrogen bonding, the positively-charged 
residue preceding the LXXLL motifs is thought to be important 
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for orienting and positioning these motifs within the co- 
activator binding groove which is capped on one end by the 



^ results showed that changing the Glu-542 into Gln-542, which 
5 neutralizes the charge but still preserves the hydrogen 
bonding, greatly reduced the ability of this mutant receptor 
to interact with class I and class II peptides. One of the 
most surprising findings of our study, however, is that the 
class III peptides, which do not contain any positively-charged 

10 residues immediately preceding the LXXLL motif, interact 
strongly with both wild type ER and the ER-3x mutant, 
supporting the hypothesis that this class of peptides binds in 
a unique manner to the ER AF-2, and that the "charged clamp" 
model may not hold for all LXXLL interactions. 

15 Because of the unique properties of the class III LXXLL, 

we searched the sequences of known nuclear receptor- interact ing 
motifs for analogous sequences. Interestingly, class Ill-like 
LXXLL motifs were found to be present in multiple copies in 
RIP14 0, where the LXXLL motifs are preceded by a 

20 serine/threonine and an isoleucine/leucine . Importantly, RIP140 
was shown to interact with the ER-3x, whereas GRIP-1 or SRC~1 
did not, suggesting that the class III peptides represent a 
biologically relevant LXXLL motif. Similar types of motifs 
were also found in orphan receptors DAX- 1 and SHP, two 

25 receptors that are able to interact with estradiol-activated 
ER and disrupt its ability to activate transcription. Although 
the domains within DAX-l/SHP-1 responsible for these 
interactions have not been precisely determined, based on their 
interaction patterns (induced by estradiol, inhibited by 

30 tamoxifen, and insensitive to ER 3x mutations) , we anticipate 
that these interactions, are mediated, at least in part, through 
LXXLL-like motifs. Since both RIP140 and SHP can disrupt wild 
type- as well as ER-3x mutant -mediated transact ivat ion, it is 
tantalizing to speculate that class III type motifs might be 

35 utilized by ER- inhibitors instead of ER-co-activators . We were 
able to show, however, that the F6 peptide (class III) can 
compete with endogenous cof actors and suppress estradiol - 
induced ER activation in target cells. This leaves open the 



negatively-charged E-542 . 



Consistent with this idea, our 
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possibility that another class of receptor co-activators that 
use the class Ill-like LXXLL motif may yet be found. Clearly, 
not all LXXLL motifs are the same. However, until each of 
these motifs is found within a bona fide ER-regulator , the 
5 functional significance of these different peptides cannot be 
determined. Regardless, our study highlights a complexity in 
ER-action heretofore unanticipated . 

All of the AF-2 interacting co-activators that have been 
found contain an LXXLL motif. Thus, given the homology in the 

10 AF-2 domain among receptors and the simplicity of the LXXLL 
motif, it was difficult to understand how receptor specificity 
could occur. Interestingly, with the collection of peptides 
we obtained, we were able to demonstrate that the ERof and ERjS, 
two highly homologous receptors with similar ligand binding 

15 characteristics, showed distinct preferences for different 
classes of peptides. Previously, we found that the ERjS 
homodimer is a weaker transcriptional activator than the ERo: 
homodimer and the ERa/jS heterodimer . It would be interesting 
to see if the differences in their transcriptional activity are 

20 due to their differential association with different cof actcrs . 
Although ERof and ER(3 have overlapping affinities for their 
ligands and DNA responsive elements, they are not functionally 
redundant. Their ability to interact differentially with 
different LXXLL motifs within co-activators might explain how 

25 ERcv and ERjS manifest different transcriptional activities in 
target cells. 

The PPARy-binding protein (PBP) and its human homolog 
TRAP220 (also called DRIP205) contain LXXLL motifs that also 
have a proline at the -2 position, similar to the class II 

30 peptides. These cof actors we're identified originally by their 
ability to interact with PPARy, TR and VDR in vivo, and were 
shown to interact with RAR and RXR with high affinity in vitro. 
A remarkably similar pattern was observed in our study when we 
demonstrated by mammalian two-hybrid analysis that TR, VDR, RAR 

35 and ER/3 appeared to have stronger preference for the class II 
peptides, suggesting that the occurrence of a proline at the - 
2 position might favor these interactions. Based upon alanine 
scanning studies, Mclnerney. et al . suggested that receptor 
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recognition is most likely contributed by residues C-terminus 
to the LXXLL motifs. In our study, however, we did not find 
a good consensus in the C-terminus in over 50 peptides selected 
from both random and focused library screening, using either 
5 ERoi or ERjS as targets. In contrast, residues at the -2/-1 
positions are dominated by either S/ (R or K) or S/(I or L), 
which suggests that residues in these positions are important 
for cofactor/ER interaction through the LXXLL motif, and that 
these sequences are generally accepted by steroid hormone 
10 receptors. Moverover, certain receptors such as TR, VDR, RXR 
and ERjS appear to favor motifs with a proline at the -2 
position, again highlighting the importance of this residue for 
f1 receptor/cof actor recognition. However, we cannot rule out the 

possibility that the differences observed may reflect a 
ry 15 selection bias as we have used only ER as a target for affinity 

}^ selection. We would also like to emphasize that although 

[J residues occupying the -1 and -2 positions seem to be a 

critical determinant of LXXLL specificity, sequences outside 
of these regions are also important, since different receptor 
£3 20 binding specificity has also been observed within the same 

1? class of peptides. For example, the ER^S-specif ic #293 peptide 

Q may be considered a class II member, because it also contains 

■ a proline at the -2 position. Clearly, however, sequences in 

addition to the proline at -2 are important since #2 93 has a 
25 unique receptor selectivity. 

The identification of novel classes of LXXLL motifs and 
the finding that they interact with ER in different ways have 
highlighted the complexity of ER-action. As yet, given the 
limited number of co-activators and co-repressors available for 
30 analysis, it is not possible to evaluate the full significance 
of our findings. However, we believe that these studies 
provide a glimpse of what is yet to come. In addition to the 
mechanistic insight offered by these studies, they have also 
provided some novel technology which may be used in drug 
35 discovery. Some investigators have utilized the co-activator 
receptor ligand assay (CARLA) as a way of screening for 
compounds which function as receptor agonists and allow the 
formation of an AF-2/co-activator groove. The CARLA assay uses 
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the known coactivator of a nuclear receptor and the nuclear 
receptor itself to screen for new agonists. The theory behind 
this is that the coactivator can only bind in the presence of 
agonist so if you see coactivator binding it indicates agonist 
5 binding- For known receptors, where the co- factor interactions 
have been well established, this is likely to be. useful. 
However, when studying an orphan receptor for which no ligand 
has been identified, its success relies on whether the receptor 
can interact with the co-activator chosen. For this purpose, 

10 a "universal" co-activator is desirable. Our studies have 
illustrated that several different LXXLL motifs interact 
differentially with different receptors. Therefore, the use 
of a single peptide in a screening paradigm can be risky, but 
the chance of success will be increased by incorporating 

15 several different classes of peptides in the screen. 
Thus , we propose 

(1) identifying a plurality of different BioKey (e.g., 
peptide) co-activators for receptors with known 
agonists and co-activators by screening for the 
20 ability of the BioKey to activate the receptor in 

the presence of the agonist and absence of the 
coactivator (at least to it activate more strongly 
than the extent made possible by the endogenous 
coactivator) , and 
25 (2) using a plurality of the identified co-activators 

simultaneously as putative co-activators in the 
screening of substances to identity an agonist of an 
orphan receptor, especially one suspected of 
recjuiring a co-activator . 
30 This is particularly of interest when the receptor is a 

nuclear receptor, in which case peptides with LXXLL motifs are 
of particular interest . 

More particularly, our assays could work in the following 
manner. For an in vitro assay we would use the standard TRF 
35 assay. The receptor would be attached by biotinylation, via 
a response element , or directly to the plastic . The 
coactivator peptides would then be biotinylated and attached 
to Europium- St rept avidin . If the compounds being tested put 
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the receptor in an active conformation then the peptide would 
bind. This would generate a signal. 

The in vivo assay could have the receptor fused to VP16 
and the "coactivator " peptides linked to Gal4DBD in a two 
5 hybrid system in a cell. The reporter constructs would be 
transfected in and then the compounds added. One would then 
look for a binding of the coactivator peptide in the presence 
of the test compounds. A signal would indicate that an agonist 
had bound the receptor and put it in a conformation that 
10 allowed the coactivator to bind. 

Another application of these peptides, validated in our 
study, is their use as peptide antagonists of receptor 
p function. For instance, the peptide #2 93 when introduced into 

cells has been shown to specifically inhibit ERjS-mediated 
nj 15 responses to estrogen. Since a specific small molecule 

inhibitor of ERjS has not been identified, we believe that the 
#2 93 peptide may allow us to unravel some of the biology of 
H this receptor. We believe that the technology utilized in our 

studies will also be useful for the study of orphan receptors. 

5 

C3 20 Specifically, we suggest that the identification of peptides, 

Vf which bind specifically to an orphan receptor, and which 

,3 inhibit its transcriptional activity, can be used as ''peptide 

^3 antagonists" to study the biology of the receptor without 

knowing its ligands. 
25 The results presented in this study confirm that the co- 

activator LXXLL motif is necessary and sufficient for receptor 
interaction. In addition, they revealed the importance of 
sequences surrounding the LXXLL core in determining receptor 
selectivity and in defining the manner in which co-activators 
30 interact with the nuclear receptors. The complexity 

highlighted by these studies suggests that the currently 
available co-activators and co-repressors represent only a 
fraction of those which will ultimately be found and shown to 
interact with the nuclear receptors. 
35 The utilities contemplated by applicants for the peptides 

disclosed in this set of Examples include, but are not limited 
to, 

1. Using the peptides in a panel to screen nuclear 
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receptors. These peptides specifically mimic 

different types of coactivators . Using the panel 
when screening will increase the chance of getting 
the receptors in the conformation needed to find a 
5 ligand with the desired activity. 

2. Using class I (SRLXXLL consensus sequence within the 
peptide) to screen nuclear receptors for the reasons 
described above . 

3. Using class II (with the consensus sequence P, 
10 hydrophobic, LXXLL somewhere within the peptide) to 

screen nuclear receptors for the reasons described 
above . 

4. Using class III (with the consensus sequence S/T, 
hydrophobic, LXXLL somewhere within the peptide) 

15 peptides to screen nuclear receptors for the reasons 

described above . 

5. Using #293 (jS specific peptide) to screen for 
ligands specific to ER|S . 

6. Using class II peptides (with the consensus sequence' 
20 P/hydrophobic/LXXLL somewhere within the peptide) to 

look for nuclear receptors that heterodimerize with 
the nuclear receptor RXR . 

7. Using D30 (class I) to differentiate between 
agonists , antagonists , and partial agonists , 

25 especially of the androgen receptor. 
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Example 201 Application of the Technology to G-Protein-Coupled 
Receptors and G a Subunits 

The in vitro drug identification system described above 
can also be extended to other classes of biological signal 
modulating proteins such as the serpentine receptors (also 
known as the G protein coupled receptors (GPCR) or seven 
transmembrane spanning receptors) and their cognate G proteins. 

Signals initiated by a variety of mammalian hormones and 
neurotransmitters are received by seven transmembrane domain 
receptors in the plasma membrane of cells and are transduced 
to intracellular effectors via heterotrimeric G proteins. 

Many different G proteins are known to interact with 
receptors. G protein signaling systems include three 

components: the receptor itself, a GTP-binding protein (G 
protein), and an intracellular target, which is usually a 
protein . 

The cell membrane acts as a switchboard. Messages- 
arriving through different receptors can produce a single 
effect if the receptors act on the same type of G protein. On 
the other hand, signals activating a single receptor can 
produce more than one effect if the receptor acts on different 
kinds of G proteins, or if the G proteins can act on different 
effectors . 

The heterotrimeric G protein is composed of a guanine 
nucleotide-binding a subunit together with a tight complex of 
/? and y subunits. As of 1997, 23 distinct alpha subunits were 
known. (Gudermann) . These have been divided into subfamilies 
Gs, Gi(Gil, Gi2, Gi3 , GoA,B, Gtl,2, Gg, Gz) , Gq (Gq, Gil, G16, 
G14, G15) , and G12(G12, G13). See Dhanasekaran, Table 1. In 
their resting state, the G proteins, which consist of alpha 
(of) , beta ()8) and gamma iy) subunits, are complexed with the 
nucleotide guanosine diphosphate (GDP) and are in contact with 
receptors. When a hormone or other first messenger binds to 
receptor, the receptor changes conformation and this alters its 
interaction with the G protein. This spurs the a subunit to 
release GDP, and the more abundant nucleotide guanosine tri- 
phosphate (GTP) , replaces it, activating the G protein. The 
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G protein then dissociates to separate the a. subunit from the 
still complexed beta and gamma subunits. The free Gof and the 
GI3y subunits both may be capable of influencing the activity 
of specific effector molecules (e.g., the enzymes adenylyl 
5 cyclase, cyclic GMP phosphodiesterase (PDE) , phospholipase C, 
phospholipase A2, and selected ion channels) . The effector 
(which is often an enzyme) in turn converts an inactive 
precursor molecule into an active "second messenger, " which may 
diffuse through the ' cytoplasm, triggering a metabolic cascade, 

10 After a few seconds, G protein signalling is terminated with 
the hydrolysis of GTP to GDP through the intrinsic GTPase 
activity of the Gof. subunit and the subsequent reassociation of 
Gof-GDP with G^y to form the inactive heterotrimer . This 
reassociation is driven by the high affinity of GDP-bound Go? 

15 for G(3y . 

Hundreds, if not thousands, of receptors convey messages 
through heterotrimeric G proteins, of which at least 17 
distinct forms have been isolated. Although the greatest 
variability has been seen in the ot subunit, several different 
20 i8 and 7 structures have been reported. 

Signalling through GPCRs has been reconstituted in yeast, 
see King, et al . , Science, 250: 121-3 (1990). Himmler, et al . , 
J. Receptor Res., 13: 79-94 (1993) transformed CHO cells with 
a reporter plasmid containing the firefly luciferase gene under 
25 the control of multiple cAMP responsive elements. Reporter 
cell lines were transfected with human dopamine Dl and D5 
receptor genes. These are GPCR genes; the GPCR couples to the 
cAMP signal transduction pathway. 

Conklin, et al . , Nature, 363: 274 (1993) has shown that 
30 three a. a. substitutions are sufficient to convert the receptor 
specificity of Galpha-q to that of Galpha-i. 

GPCRs have been extensively exploited as targets for drug 
discovery in many therapeutic areas such as gastrointestinal, 
cardiovascular and neurological diseases. The ability to 
35 rapidly and inexpensively identify drugs that activate or block 
GPCRs would be of great utility to the pharmaceutical industry. 

By detecting GTP: GDP exchange, one may conduct a membrane 
(as opposed to intact cell) based assay for agonists and 
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antagonists of GPCRs . Some GPCRs interact with members of the 
Gi family of G proteins, which have relatively high exchange 
rates. To facilitate study of GPCRs which interact with other 
G proteins, such as Gs or Gq, chimeric G-alpha proteins have 
5 been constructed which have a Galpha-i backbone. 

Another screening strategy involves detection of cAMP, 
directly or indirectly (activation of a cAMP reporter system) . 
If a GPCR does not associate with a Gs protein, it may be 
advantageous to construct a chimera of the appropriate G-alpha 
10 subunit with the Gas backbone. 

The "universal" G proteins G15 and G16 are activated by 
a very wide range of GPCRs and hence are quite, useful to 
screening orphan receptors . 

All GPCRs have at least two functional domains. One is 
15 the ligand binding domain on the external surface and the other 
is the G protein binding domain that is on the intracellular 
surface . 

In their quiescent state the G proteins that are activated 
by GPCRs exist as G protein {oi(3y) heterotrimers containing 

20 guanine diphosphate (GDP) bound to Ga subunits. GPCRs activate 
their cognate G proteins by acting as guanine nucleotide 
exchange factors (GEFs) . Upon GPCR activation, free GTP 
replaces GDP bound to the ot subunit of the G protein. The GTP- 
bound Go: subunit and G^y then dissociate and regulate the 

25 function of second messenger enzymes and ion channels. Before 
GPCRs can activate G proteins, they must be switched from an 
inactive to an active state by the action of the appropriate 
ligand. GPCRs have little or no detectable affinity for their 
cognate G proteins until activated. Chemicals that mimic the 

3 0 action of GPCR ligands are known as agonists and induce a 
change in the GPCR such that it acquires selective affinity for 
its cognate G protein. Chemicals that block the action of the 
GPCR ligands are known as antagonists and prevent the induction 
of structural changes necessary for the GPCR to bind to the 

35 cognate G protein. 

Chadhi, et al . , Fundam. Clin. Pharmacol., 12:121-132 
(1998) , Table 1 sets forth the sequences of a number of 
peptides which activate G proteins. Several contain the 
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positively charged RPK tripeptide. 

Mochizuki, et al . , Gene, 181: 39-43 (1996) used a yeast 
two-hybrid system to identify proteins which interact with the 
ot subunit of G12 . Full length G12 was fused to the DNA binding 
5 domain of GAL4 as bait, and proteins from a human B cell cDNA 
library were fused to the complementary activation domain. The 
authors identified clones which grew in the absence of 
histidine and which could activate the lacZ reporter gene. One 
clone encoded the C-terminal of a previously unrecognized 
10 protein; the full length cDNA for this protein was isolated. 
The protein was termed LGN because of its richness in LGN 
repeats, 

Rasenick, et al . , J. Biol. Chem. , 269: 21519-21525 (1994) 
tested the ability of Gsof-derived peptides (15-29, 354-372, 
15 384-394) and Gioi-derived peptides (8-22, 315-324, 345-455) to 
interfere with coupling between the )S-adrenergic receptor and 
Gs . 

BioKeys (peptides and similar molecules) that probe GPCRs 
can be directed against either the GPCRs themselves or their 

20 cognate Got subunits. The Go? subunits can indicate GPCR 
activation because when GPCRs are activated GTP is bound to the 
Got subunit, and when they are inactive GDP is bound. For GPCRs 
these BioKeys will specifically recognize two basic functional 
domains, the ligand binding site and the activated receptor via 

25 the G protein-binding site. They may also bind the 

intracellular or extracellular region of the GPCR, independent 
of activation site. In the case of Got subunits the BioKeys 
will specifically recognize the GDP or GTP bound forms, or the 
Got subunit independent of bound GDP or GTP. Thus, six classes 

3 0 of BioKeys can be identified, three for each Got subunit and 
three for each GPCR. Such Biokeys can be of immense value for 
the identification of new therapeutic agents (drugs) using in 
vitro screening methods. 

At present, drugs that act on GPCRs are generally 

35 identified in either of two ways. One (the cell based assay) 
is very cumbersome and expensive to carry out . The other 
(ligand displacement assay) is the use of labeled ligands to 
determine the ability of a test substance to compete for the 
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binding of the ligand to the GPCR. While the latter is 
substantially less expensive and more convenient to carry out, 
it is not possible to distinguish agonists from antagonists and 
thus the usefulness of such an assay is limited. 
5 Through the use of BioKeys specific to each of the basic 

functional domain of GPCRs, we can carry out simple, 
inexpensive in vitro screens for both agonists and antagonists 
to GPCRs and can distinguish the complete range of activities 
for such compounds from pure agonists, to partial agonists to 
10 complete antagonists. Furthermore, we can identify SGM's which 
would be extremely difficult and expensive to do using prior 
technologies . 

Example A: Screen for agonists and antagonists to the 
beta- two adrenergic receptor (BAR) . BAR is activated by 

15 ligands such as epinephrine and isoproterenol. These ligands 
are agonists and are useful for the treatment of diseases such 
as asthma and severe allergic reactions. Antagonists to AR are 
useful for regulating cardiac function and the treatment of 
hypertension and cardiac arrhymias . 

20 Class I BioKeys to BAR are identified by affinity 

selection of high affinity (better than 50 micromolar) peptides 
from BioKey libraries. BAR can be produced using recombinant 
techniques known to those in the art using systems such as the 
baculovirus expression system in insect cells. BAR containing 

25 membranes or purified BAR can be used for the affinity 
selection of BioKeys to the ligand binding site. Selectivity 
for that site can be confirmed by the displacement of the 
BioKey by the agonist isoproterenol. 

In a similar manner. Class II BioKeys can be identified 

3 0 to the G protein binding domain of activated BAR. First, BAR 
is pretreated with an excess of an agonist e.g. isoproterenol 
to induce the BAR into an "active" conformation. BioKeys are 
selected as described herein and selectivity and specificity 
is confirmed by their ability to bind to agonist treated but 

3 5 not to untreated BAR or antagonist treated BAR. 

Class III BioKeys can be identified to the GDP-bound form 
of GsQf subunits. Purified Gsa subunits are produced and 
purified using recombinant techniques and expression in 
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bacterial cells or in baculovirus -modified insect cells. 
Purified Gsa subunits are pretreated with an excess of GDP to 
induce the "inactive" conformation or GDP-bound form of Gsof 



5 selectivity and specificity is confirmed by their ability to 
bind to GDP treated, but not to GTP treated Gsa subunits. 

In a similar manner, Class IV BioKeys can be identified 
to the GTP-bound form of Gsof subunits. Purified Gsc^ subunits 
are pretreated with an excess of GTP to induce the "active" 
10 conformation or GTP-bound form of Gso: (GTP-Gsof) . BioKeys are 
selected as described herein and selectivity and specificity 
is confirmed by their ability to bind to GTP treated, but not 
to GDP treated Gso; subunits. 



15 of Gof. 

Class VI Biokeys bind to either the intracellular or 
extracellular regions of the BAR (GPCR) independent of 
activation state. 

Representative BioKeys to all six classes, or to a subset 

20 thereof, can be labeled with a suitable moiety as described 
elsewhere herein such as europium labeled streptavidin and used 
in a drug screen using fluorescence measuring devices. Many 
other means of using BioKeys as surrogate ligands will be 
apparent to those skilled in the art of drug compound 

25 screening . 

As can be seen from Table 5, it is very easy to 
distinguish three classes of compounds from a chemical compound 
collection. Inactive compounds do not bind to relevant 
functional domains on the AR and thus no change in signals from 

3 0 either BioKey is seen. For GPCRs, agonists are easily 
identified due to their ability to induce the AR into a 
conformational state such that the G protein binding domain is 
capable of binding class II BioKeys that are surrogates for the 
AR' s cognate G protein. Agonists that bind to the ligand 

35 binding site will also lead to a measurable decrease in class 
I BioKey binding. Antagonists are identified by their ability 
to bind to the ligand binding domain and hence are capable of 
blocking the natural ligand from binding. However, unlike 



(GDP-Gsq:) . 



BioKeys are selected as described herein and 



Class V Biokeys bind both the GDP and the GTP bound forms 



# % 

190 

agonists, they have no ability to induce the activation of the 
receptor; hence there is no change in the conformation of the 
AR' s G protein binding domain and thus no change in ability to 
bind class II BioKeys . 
5 Alternatively, screening for compound agonists or 

antagonists can be performed using AR-containing membranes, Gs 
and BioKeys specific for GTP-Gso! or GDP-Gsof. Agonists will 
activate the receptor and result in the formation of GTP-Gsor. 
The signal from BioKeys specific to GDP-Gsof will decrease if 

10 an agonist is binding the receptor. Likewise, the signal from 
BioKeys specific to GTP-Gso? will increase if an agonist is 
bound to AR. Antagonists are not capable of activating the 
receptor and therefore are not able to activate G proteins. 
The G protein will then remain a heterotrimer containing GDP 

15 bound to its Gsor subunit . Screen for antagonist using AR- 
containing membranes pretreated with an agonist. The signal 
from BioKeys specific to GTP-Gsa will decrease if an antagonist 
is binding the receptor. Likewise, the signal from BioKeys 
specific to GDP-Gso; will increase if an antagonist is bound to 

20 AR. BioKeys that bind Gso? in both its GDP and GTP-bound forms, 
will bind to Gso! in the presence of either an agonist or 
antagonist. This class of BioKeys can be used as a control for 
G protein functionality/stability. 

This system can be readily extended to other GPCRs for 

25 which one has access to a natural ligand or an agonist. 

If desired, the assay may employ a GTP or GDP analogue in 
place of GTP or GDP. The analogue must be able to fulfill the 
functional role of GTP or GDP, respectively, in the GPCR/G 
protein signal transduction system. Either the analogue must 

30 be recognized by the BioKey peptide specific for the wild-type 
GTP or GDP, as appropriate, or a complementary BioKey specific 
for the analogue must be developed and used. 
A list of GTP analogues follows: 

Guanosine 5' -O- (3 - thiotriphosphate) /GTP 
3 5 gamma S 

Guanyl-5' -ylimidodiphosphate (GMPPNP) 

Guanyl - 5 ' -ylmethylenediphosphate 
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Guanosine 5' - [beta,gamma- 

imido] triphosphate 

Guanosine 5' - [gamma- thio] triphosphate 

2 ' , 3 ' -dideoxyguanosine 5 ' triphosphate 
5 (ddGTP) 

2' ,3' -dialdehyde GTP (oGTP) 

gaunyl - 5 - yl imidodiphospha t e ( GppNHp ) 

cytidylyl (5 ' -3 ') guanosine 5 ' -triphosphate 
(pppGpc) 

10 There are parallel GDP analogues, too. 
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Example 202: Discovery of GDP - dependent , GTP-dependent and 
GDP/GTP- independent Peptides Specific for Gioil using Phage 
Affinity Selection and Phage Display Peptide Libraries 

Phage display was performed on biotinylated Giofl in buffer 
5 A in the presence or absence of GDP or GTP7S . The deduced 
amino acid sequences of Gial-binding phage peptides formed 
several different sequence clusters (a group of peptides with 
amino acid sequence similarity) . 

All phage were isolated by affinity selection on 

10 biotinylated Giofl and subjected to DNA sequence analysis were 
examined for Giofl specificity against a panel of other 
biotinylated proteins (alkaline phosphatase, glucose-6- 
phosphate, jS-galactosidase , hexokinase) . The phage were found 
to be specific for Giofl protein. 

15 Gickfl protein amounts and phage volumes were titrated to 

determine where the optimal GDP/GTP or GTP/GDP differential 
binding occurs for the phage. It was determined that one pmole 
of Gial and 5 ^1 of phage in an ELISA yielded the optimal 
differential binding of phage to Giofl . All phage containing 

20 peptides of known sequence were categorized into GDP - dependent , 
GTP-dependent or GDP/GTP- independent Gial binders. 

GDP/GTP -dependent phage differentiation assays were also 
performed with G±al in the presence of GDP and AIF4", a complex 
that mimics the transition state for GTP hydrolysis of the Got 

25 subunit . AIF4" is an activator of GDP-bound Go? subunits that is 
believed to substitute for and mimic the y-phosphate of GTP. 

Only GTP- specific phage were found to bind in presence of 
A1F4- and GDP. 

GTP-dependent peptides (T-peptides) , GDP-dependent 
3 0 peptides (D-peptides) and GDP/GTP- independent peptides (I- 
peptides) were synthesized, and biotinylated or f luorescently 
labeled . 

Materials and Methods 
Materials, GDP and GTPyS were purchased from Sigma. Immulon 
35 4 plates were purchased from Dynatech. Purified Giofl was 
kindly provided by Merck. Sequencing of single- strand M13 DNA 
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25 



30 



was conducted by Sequetech Corporation (Mountain View, CA) 



identified using the' following procedures. Phage affinity- 
selection of biotinylated Giofl was performed in the presence 
of GDP, GTP7S or buffer A alone, essentially as previously 
described (Sparks, A.B., Adey, N.B., Cwirla, S. & Kay, B.K. 
(1996) in Phage Display of Peptides and Proteins, A Laboratory 
Manual, eds . Kay, B.K., Winter, J. & McCafferty, J. (Academic, 
San Diego) , pp. 227-253) . The nonhydrolyzable form of GTP, 
GTPys, was used in all experiments to prevent hydrolysis of GTP 
to GDP. Briefly, Immulon 4 plates were coated with 

streptavidin in 0 . 1 M sodium bicarbonate followed by blocking 
with 1.0% BSA in 0 . IM sodium bicarbonate. The plates were then 
incubated for Ih at room temperature (RT) with 10 pmoles of 
purified biotinylated Gioflprotein in Buffer A (20 mM Hepes, pH 
7.5 buffer containing 1 mM EDTA, 16 mM MgCls, 1 mM DTT and 
0.05% Tween 20) alone or Buffer containing either 5 /xM GDP or 
5 ixM GTPyS . Streptavidin binding sites not containing a bound 
Giofl protein were blocked with a Buffer A solution containing 
0 . 5 mM biotin. Twenty different phage libraries were tested, 
one library per well. After incubation of the phage libraries 
with the immobilized protein for 3h at RT, nonspecif ically 
bound phage were removed by washing with TBST buffer (10 mM 
Tris-HCl, pH 8.0, 150 mM NaCl, 0.05% Tween 20) containing 0.5 
mM biotin. Phage that bound specifically to the Gial protein 
were then eluted sequentially with a low pH glycine buffer and 
a high pH ethanolamine buffer. The pH of the eluted phage was 
then neutralized and the phage were amplified and repanned 
using iterative rounds of the affinity selection procedure. 
Each phage library was panned until the amplified pool of phage 
bound to immobilized Giorl significantly above no protein 
controls as judged by phage ELISA. At this point, phage were 
plated out and individual phage were picked and examined by 
ELISA for specific binding to Gial and a panel of other 
unrelated biotinylated proteins. Individual phage displaying 
peptides specifically binding to Giofl were then plaque purified 
and DNA was isolated from .these ^phage . The DNA sequence of the 



Phage Affinity Selection. 



Giofl specific peptides were 
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segment encoding the peptide of interest was then determined 
along with the deduced amino acid sequence of the peptide. 



Phage ELISA. Giofl was immobilized in the presence of either 
GDP or GTPyS as described for phage affinity selection. Phage 
5 (50 ^1) from either a 5h or overnight culture grown in DHSofF' 
cells was added directly to wells containing immobilized Gial 
and incubated for 4 5 min at RT. Unbound phage were removed by 
TEST washes and bound phage were detected using an anti-M13 
antibody conjugated to horse radish peroxidase. Assays were 
10 developed for 10 min at RT by adding 2 , 2-azinobis (3- 
ethylbenzothiazoline) -6 sulfonic acid and hydrogen peroxide. 
The development was stopped by adding SDS to a final 
concentration of 1%. Absorbance was measured at 405 nm in a 
Molecular Devices microplate reader. 

15 GDP/GTP -Dependent Phage Differentiation , Phage were 

categorized into GDP-dependent , GTP-dependent or GDP/GTP- 
independent Giojl binders as described for Phage ELISAs with the 
following modifications. Giofl was incubated overnight in 
buffer A containing either 100 /xM GDP or GTP7S, and then one 

20 pmol of this protein was immobilized on plates as previously 
described. Phage (5 /il) was added to the wells in the presence 
of buffer A containing either 100 fiM GDP or GTPyS and incubated 
for 3 0 min at RT. Phage was also added to no protein control 
wells in the presence of Buffer A. 

25 Results 

The results are shown in Tables 202A-202C, below. 
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Example 203: In Vitro Assays Using GDP - dependent , GTP-dependent 
and/or GDP /OTP -independent Peptides for the Detection of 
Agonists and Antagonists of G-Protein Coupled Receptors 



Definitions ; 

5 D-peptide: A peptide that binds preferentially to the GDP/Gckf 
protein complex compared to the GTP/Gof or the GTPyS/G<y complex. 

T-peptide: A peptide that binds preferentially to the GTP/Go? 
or the GTPyS/Gq; protein complex compared to the GDP/Go; complex. 

I-peptide: A peptide that binds to Go; independent of the 
10 activation state. 

Type 1: Scintillation Proximity Assay (SPA) 

Protocol 1.1. Detection of agonist binding using a T-peptide 

1. Membranes containing the GPCR and G protein to be 
assayed are attached to the surface of a scintillant plate, 

15 such as a FlashPlate (NEN) , or to scintillant beads, such as 
SPA-beads (Amersham) . 

2. Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Go; subunit from the GjS and Gy subunits and conversion to tihe 

20 activated Gce-GTP conformation. 

3 . A radiolabeled T-peptide can then bind to the activated 
Gof/GTP or Ga/GTPyS complex. 

4. The binding of the T-peptide results in production of 
light by the scintillant that is in close proximity to the 
25 membrane/Gof/T-peptide complex. This light is detected by a 
scintillation counter . 

Protocol 1.2. Detection of an antagonist in the presence of 
an agonist using a T-peptide 

1. Membranes containing the GPCR and G protein to be assayed 
30 are attached to the surface of a scintillant plate, such as a 
FlashPlate (NEN) , or to scintillant beads, such as SPA-beads 
(Amersham) . ... .. _ . , 
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2 . An agonist is added at subsaturating levels to the assay- 
to activate the GPCR, when no antagonist is present, resulting 
in activation of the heterotrimeric G protein to dissociate the 
Go; subunit from the GjS and Gy subunits and conversion to the 
5 activated Gof-GTP conformation. 

3 . A radiolabeled T-peptide can then bind to the activated 
Gckf/GTP or Gof/GTPYS complex. 

4 . Compounds to be assayed are incubated to allow an 
antagonist to bind to the GPCR. 

10 5. The binding of the T-peptide results in production of 

light by the scintillant that is in close proximity to the 
membrane/Gof/T-peptide complex. This light is detected by a 
scintillation counter. An antagonist is detected by a 
reduction in the scintillation signal . 

15 Protocol 1.3. Detection of agonist binding using a D-peptide 
1. Membranes containing the GPCR to be assayed are 
attached to the surface of a scintillant plate, such as a 
FlashPlate (NEN) , or to scintillant beads, such as SPA-beads 
(Amersham) . 

20 2. A radiolabeled D-peptide is bound to the inactive 

heterotrimeric G protein/GDP complex. 

3 . Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Got subunit from the Gj8 and Gy subunits and conversion to the 

25 activated Gof-GTP conformation and the D-peptide is released 
from the activated Gof/GTP or Ga/GTPyS complex. 

4 . The presence of an agonist is detected by a decrease 
in the production of light by the scintillant that is in close 
proximity to the membrane/heterotrimeric G protein/GDP/D- 

30 peptide complex by a scintillation counting. 

Protocol 1.4. Detection of an antagonist in the presence of 
an agonist using a D-peptide 

1. Membranes containing the GPCR to be assayed are 
attached to the surface of a scintillant plate, such as a 
35 FlashPlate (NEN) , or to scintillant beads, such as SPA-beads 
(Amersham) 



4 % 

197 

2. A radiolabeled D-peptide is bound to the inactive 
heterotrimeric G protein/GDP complex. 

3. An agonist is added to the assay to activate the GPCR, 
when no antagonist is present, resulting in activation of the 

5 heterotrimeric G protein to dissociate the Go; subunit from the 
G)8 and Gy subunits and conversion to the activated Gq;-GTP 
conformation 

4. When no antagonist is present, the D-peptide is 
released from the activated Gof/GTP or Gqj/GTPyS complex 

10 resulting in a decrease in the production of light by the 
scintillant that is in close proximity to the membrane /Gor/D- 
peptide complex as detected by a scintillation counting. 
i3 5. Compounds to be assayed are incubated in the well to 

'I allow an antagonist to bind to the GPCR. 

fU 15 6 . The presence of an antagonist is detected by the 

='1 increased production of light by the membrane/heterotrimeric 

IJ G protein/GDP/D-peptide complex. 

s 
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Type 2 : FP Assays 

Protocol 2.1. Detection of agonist binding using a T-peptide 

1. Membranes containing the GPCR to be assayed are 
incubated with possible agonist compounds. 

5 2 . Activation of the GPCR by an agonist results in 

activation of the heterotrimeric G protein to dissociate the 
Got subunit from the GjS and Gy subunits and conversion to the 
activated Gof-GTP conformation. 

3 . A f luorescently labeled T-peptide can then bind to the 
10 activated Gck!/GTP or Ga/CTPyS complex. Many fluorescent labels 
can be used such as, but not limited to, fluorescein, 
rhodamine, texas red, Cy-3, and Oregon green. 

4. The binding of the peptide results in polarization of 
the fluorescent light emitted by the fluorescent peptide in the 

15 membrane/Gof/T-peptide complex. 

5 . An agonist is detected by an increase in- the 
polarization of emitted light. 

Protocol 2.2. Detection of antagonist binding using a T- 
peptide 

20 1. Membranes containing the GPCR to be assayed are 

incubated with possible antagonist compounds. 

2. An agonist is added to produce and activation of the 
GPCR resulting in activation of the heterotrimeric G protein 
to dissociate the Got subunit from the G^S and Gy subunits and 

25 conversion to the activated Gof-GTP conformation. 

3 . A f luorescently labeled T-peptide can then bind to the 
activated Gof/GTP or Gof/CTPyS complex. 

4. The binding of the peptide results in polarization of 
the fluorescent light emitted by the fluorescent peptide in the 

3 0 membrane/Ga/T-pept ide complex. 

5. An antagonist is detected by the lack of polarization 
of the T-peptide indicating that the Go? was not activated. 

Protocol 2.3. Detection of agonist binding using a D-peptide 
1 . A membrane/heterotrimeric G protein/GDP complex 
35 containing the GPCR to be assayed is incubated with possible 
agonist compounds . and a . f luorescently labeled D-peptide that 
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will bind to to the Ga/GDP complex. 

2. The bound peptide will produce emitted light that is 
polarized . 

3 . Activation of the GPCR by an agonist results in 
5 activation of the heterotrimeric G protein to dissociate the 

Gof subunit from the G/8 and Gy subunits and conversion to the 
activated Ga-GTP conformation. 

4. The f luorescently labeled D-peptide will not bind to 
the activated Gof/GTP or Ga/GTPyS complex and no longer emits 

10 polarized light . 

5. An agonist is detected by the loss of polarized light. 

Protocol 2.4. Detection of antagonist binding using a D- 
peptide 

1. A membrane/heterotrimeric G protein/GDP complex 
15 containing the GPCR to be assayed is incubated with possible 

antagonist compounds . 

2 . An agonist is added to activate the GPCR resulting in 
activation of the heterotrimeric G protein to dissociate the 
Go? subunit from the G^ and G7 subunits and conversion to the 

20 activated Ga-GTP conformation. 

3. The f luorescently labeled D-peptide that will bind to 
to the Gof/GDP complex is then added to the activated receptor/G 
protein in membranes. When an antagonist is present, the 
f luorescently labeled D-peptide is not released from the 

2 5 membrane/heterotrimeric G protein/GDP complex. 

4 . An antagonist is detected by the production of 
polarized light indicating that the Go? was not activated. 

Protocol 2.5. Detection of agonists using both a D-peptide and 
T -peptide simultaneously 

3 0 1. A membrane/heterotrimeric G protein/GDP complex 

containing the GPCR to be assayed is incubated with possible 
agonist compounds, a f luorescently labeled D-peptide that will 
bind to to the Gof/GDP complex and a T-peptide that is labeled 
with a fluorescent group with excitation and emission 
35 wavelengths that are different from the D-peptide label. 

2 . Activation of the GPCR by an agonist results in 
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activation of the heterotrimeric G protein to dissociate the 
Gee subunit from the Gf3 and Gy subunits and conversion to the 
activated Gce-GTP conformation. 

3, When an agonist is present, the f luorescently labeled 
5 D-peptide is released from the membrane/Gof/GTP complex with the 

concurrent binding of the T-peptide to the membrane /Gof/GTP 
complex . 

4 . An agonist is detected by the loss of polarized light 
at the emission wavelength of the D-peptide and the gain in 

10 polarization at the emission wavelength of the T-peptide. 

Protocol 2.6. Simultaneous detection of antagonists using both 
a D-peptide and T-peptide 

1. A membrane/heterotrimeric G protein/GDP complex 
containing the GPCR to be assayed is incubated with possible 

15 antagonist compounds, a f luorescently labeled D-peptide that 
will bind to to the Gof/GDP complex and a T-peptide that is 
labeled with a fluorescent group with excitation and emission 
wavelengths that are different from the D-peptide label. 

2. An agonist is added to activate the GPCR resulting in 
20 activation of the heterotrimeric G protein to dissociate the 

Go; subunit from the Gj8 and Gy subunits and conversion to the 
activated Ga-GTP conformation. 

3. When an antagonist is present, the f luorescently 
labeled D-peptide binds to the membrane/Gof/GDP complex and the 

25 T-peptide does not bind to that complex. 

4. Antagonists are detected by retention of polarized 
light at the emission wavelength of the D-peptide and no gain 
in polarization at the emission wavelength of the T-peptide. 



201 

Type 3 : Ligand/Surrogate Ligand Binding Assays 

This type differs from Type 2 only in the detection 
method . 

Protocol 3.1. Detection of agonist binding using a T-peptide 
5 1. Membranes containing GPCR to be assayed are immobilized 

on the surface of a microtiter plate or immobilized by another 
method (ie beads) and incubated with possible agonist 
compounds . 

2 . Activation of the GPCR by an agonist results in 
10 activation of the heterotrimeric G protein to dissociate the 
Go; subunit from the GjS and Gy subunits and conversion to the 
activated Ga-GTP conformation. 

3 . A labeled T-peptide is added and incubated so that it 
can bind to the activated Gor/GTP or Gof/GTPyS complex, many 

15 labels can be used such as, but not limited to, fluor*esent 
moieties, radioactivity, or enzymes such as alkaline 
phosphatase or i8-galactosidase . 

4 . After incubation of the immobilized membrane/GPCR/Go; 
complex is washed and the amount of labeled peptide that binds 

20 to the plate is quantitated by fluorescence, scintillation 
counting, luminescence, or spectrophotometry. 

5. An agonist is detected by an increase in the quantified 
signal indicating that the GPCR was activated. . 

Protocol 3.2. Detection of antagonist binding using a T- 
25 peptide 

1. Membranes containing GPCR to be assayed are immobilized 
on the surface of a microtiter plate or immobilized by another 
method (ie beads) and incubated with possible antagonist 
compounds . 

3 0 2. After incubation with possible antagonists, an agonist 

and a lableled T-peptide are added to allow for activation of 
the GPCR and activation of the heterotrimeric G protein to 
dissociate the Go; subunit from the G/3 and Gy subunits and 
conversion to the activated Gof-GTP conformation. 

35 3. The labeled T-peptide can then bind to the activated 
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Ga/GTP or Gof/GTPyS complex, many labels can be used such as, 
but not limited to, fluoresent moieties, radioactivity, or 
enzymes such as alkaline phosphatase or jS-galactosidase . 

4. After incubation of the immobilized membrane /GPCR/Gof 
5 complex is washed and the amount of labeled peptide that binds 

to the plate is quant itated by fluorescence, scintillation 
counting, luminescence, or spectrophotometry. 

5. An antagonist is detected by a decrease in the 
quantified signal indicating that the GPCR was not activated. 

10 Protocol 3.3. Detection of agonist binding using a D-peptide 
1. Membranes containing the GPCR to be assayed are 
immobilized on the surface of a microtiter plate or immobilized 
by another method (ie. beads) and incubated with possible 
agonist compounds . 

15 2. Activation of the GPCR by an agonist results in 

activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the GjS and Gy subunits and conversion to the 
activated Ga-GTP conformation. 

3. A labeled D-peptide is added and incubated so that it 
2 0 can bind to the inactive heterotrimeric G protein complex, many 

labels can be used such as, but not limited to, fluoresent 
moieties, radioactivity, or enzymes such as alkaline 
phosphatase or j8-galactosidase . 

4. After incubation of the immobilized membrane/GPCR/Gof 

2 5 complex is washed and the amount of labeled peptide that binds 

to the plate is quantitated by fluorescence, scintillation 
counting, luminescence, or spectrophotometry. 

5. An agonist is detected by a decrease in the quantified 
signal indicating that the GPCR was activated. 

30 Protocol 3.4. Detection of antagonist binding using a D- 
peptide 

1. Membranes containing GPCR to be assayed are immobilized 
on the surface of a microtiter plate or immobilized by another 
method (ie beads) and incubated with possible antagonist 

3 5 compounds . 

2. After incubation with possible antagonists, an agonist 
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and a lableled D-peptide are added to allow for activation of 
the GPCR and activation of the heterotrimeric G protein to 
dissociate the G<y subunit from the GjS and Gy subunits and 
conversion to the activated Gof-GTP conformation. 
5 3 . A labeled D-peptide is added and incubated so that it 

can bind to the inactive heterotrimeric G protein complex, many 
labels can be used such as, but not limited to, fluoresent 
moieties, radioactivity, or enzymes such as alkaline 
phosphatase or /S-galactosidase . 

10 4. After incubation of the immobilized membrane /GPCR/Gof 

complex is washed and the amount of labeled peptide that binds 
to the plate is quantitated by fluorescence, scintillation 
counting, luminescence, or spectrophotometry. 

5. An antagonist is detected by an increase in the 

15 quantified signal indicating that the GPCR was not activated. 
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Type 4 : FRET Assays 

Protocol 4.1. Detection of agonist binding using a T-peptide 
and an I -peptide 

1 . Membranes containing the GPCR to be assayed are 
5 incubated with possible agonist compounds. 

2 . Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Gof subunit from the G(3 and Gy subunits and conversion to the 
activated Gof-GTP conformation. 
10 3 . A f luorescently labeled T-peptide and a f luorescently 

labeled I -peptide with fluorophores matched for FRET are added. 

4. The T-peptide will bind to the activated Ga/GTP or 
GQf/GTP7S complex and the I -peptide will bind to the either the 
Gof/GTP, Gqj/GTPyS or Gof/GDP complex. 
15 5. When both peptides are bound to the Gof/GTP or GQf/GTP7S 

complex excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

6. An agonist is detected by an increase in the emission 
of light from the acceptor fluorophore. 

20 Protocol 4.2.. Detection of antagonist binding using a T- 
peptide and an I -peptide 

1. Membranes containing the GPCR to be assayed are 
incubated with possible antagonist compounds. 

2 . An agonist is added to produce and activation of the 
25 GPCR resulting in activation of the heterotrimeric G protein 

to dissociate the Go; subunit from the G^ and Gy subunits and 
conversion to the activated Gof-GTP conformation. 

3. A f luorescently labeled T-peptide and a f luorescently 
labeled I -peptide with fluorophores matched for FRET are added. 

30 4. The T-peptide will bind to the activated Gof/GTP or 

GQf/GTPyS complex and the I -peptide will bind to the either the 
Goi/GTP, Gof/GTPYS or Gq;/GDP complex. 

5 . When both peptides are bound to the Gof/GTP or Gof/GTPyS 
complex excitation of the donor fluorophore results in the 

35 emission of light from the acceptor fluorophore. 

6 . An antagonist is detected when no emission of light 
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from the acceptor f luorophore occurs . 

Protocol 4.3. Detection of agonist binding using a T-peptide 
and a Goj/f luorescent protein fusion 

1 . Membranes containing the GPCR to be assayed and a 
5 heterotrimeric G protein containing a Gof/ fluorescent protein 

fusion are incubated with possible agonist compounds. 

2. Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Go? subunit from the Gj8 and Gy subunits and conversion to the 

10 activated Gof-GTP conformation. 

3 . A T-peptide that is f luorescently labeled with a 
f luorophore whose excitation or emission wavelength matches the 
fluorescent protein fused to the Gof for FRET is added. 

4 . The T-peptide will bind to the activated Gof/GTP or 
15 Ga/GTPyS complex. 

5. When the T-peptide is bound to the Ga/GTP or Ga/CTPyS 
complex, excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

6 . An agonist is detected by an increase in the emission 
20 of light from the acceptor fluorophore. 

Protocol 4.4. Detection of antagonist binding using a T- 
peptide and a Gof/f luorescent protein fusion 

1. Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein containing a Ga/f luorescent protein 

25 fusion are incubated with possible antagonist compounds. 

2. An agonist is added to activate the GPCR resulting in 
activation of the heterotrimeric G protein to dissociate the 
Go; subunit from the GjS and G7 subunits and conversion to the 
activated Ga-GTP conformation. 

30 3. A f luorescently labeled T-peptide and a f luorescently 

labeled I -peptide with fluorophores matched for FRET are added. 

4 . A T-peptide that is f luorescently labeled with a 
fluorophore whose excitation or emission wavelength matches the 
fluorescent protein fused to the Go; for FRET is added. 

35 5. The T-peptide will bind to the activated Gof/GTP or 

GQf/GTP7S complex. 
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6 . When the T-peptide is bound to the Gof/GTP or Gq;/GTPyS 
complex, excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

7. An antagonist is detected when no emission of light 
5 from the acceptor fluorophore occurs. 

Protocol 4.5. Detection of agonist binding using a T-peptide 
and a biotinylated Go? protein 

1 . Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein containing a biotinylated Go; protein 

10 (produced by either ±n vitro or in vivo biotinylation methods) 
are incubated with possible agonist compounds. 

2. Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the G/3 and Gy subunits and conversion to the 

15 activated Ga-GTP conformation. 

3. A f luorescently labeled T-peptide and f luorescently 
labeled streptavidin (can also include neutravidin or avidin)' 
with fluorophores matched for FRET are added. 

4. The T-peptide will bind to the activated Ga/GTP or 

2 0 Ga/GTPyS complex and the steptavidin will bind to the 

biotinylated Gor . 

5. When the T-peptide is bound to the Ga/GTP or Gckf/GTPyS 
complex, excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

25 6 . An agonist is detected by an increase in the emission 

of light from the acceptor fluorophore. 

Protocol 4.6. Detection of an antagonist binding using a T- 
peptide and a biotinylated Go; protein 

1. Membranes containing the GPCR to be assayed and a 

3 0 heterotrimeric G protein containing a biotinylated Ga protein 

(produced by either in vitro or in vivo biotinylation methods) 
are incubated with possible antagonist compounds. 

2. An agonist is added to activate the GPCR resulting in 
activation of the heterotrimeric G protein to dissociate the 

3 5 Gof subunit from the G/3 and Gy subunits and conversion to the 
activated Ga-GTP conformation. 
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3. A f luorescently labeled T-peptide and f luorescently 
labeled streptavidin (can also include neutravidin or avidin) 
with f luorophores matched for FRET are added . 

4. The T-peptide will bind to the activated Ga/CTP or 
5 Gof/GTPyS complex. 

5. When the T-peptide is bound to the Gof/GTP or Gof/CTPyS 
complex, excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

6. An antagonist is detected when no emission of light 
10 from the acceptor fluorophore occurs . 

Protocol 4.7. Detection of agonist binding using a T-peptide 
and fluorescent membranes 

1. Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein are labeled with a fluorescent dye and 

15 then incubated with possible agonist compounds. 

2. Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Go; subunit from the GjS and Gy subunits and conversion to the 
activated Gce-GTP conformation. 

20 3. A T-peptide labeled with a fluorophore matched for FRET 

with the membrane dye is added. 

4. The T-peptide will bind to the activated Gq?/GTP or 
Ga/GTPyS complex . 

5. When the T-peptide is bound to the Ga/GTP or Gor/GTPyS 
25 complex, excitation of the donor fluorophore results in the 

emission of light from the acceptor fluorophore . 

6. An agonist is detected by an increase in the emission 
of light from the acceptor fluorophore. 

Protocol 4.8. Detection of an antagonist binding using a T- 
3 0 peptide and fluorescent membranes 

1. Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein are labeled with a fluorescent dye and 
then incubated with possible antagonist compounds. 

2. An agonist is added to activate the GPCR resulting in 
35 activation of the heterotrimeric G protein to dissociate the 

Go; subunit from the GjS and G7 subunits and conversion to the 
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activated Gof-GTP conformation. 

3 . A T-peptide labeled with a f luorophore matched for FRET 
with the membrane dye is added. 

4. The T-peptide will bind to the activated Ga/GTP or 
5 Ga/GTP7S complex . 

5. When the T-peptide is bound to the Gof/GTP or Gcv/GTPyS 
complex, excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

6. An antagonist is detected when no emission of light 
10 from the acceptor fluorophore occurs. 

Protocol 4.9. Detection of agonist binding using a D-peptide 
and an I-peptide. 

1 . Membranes containing the GPCR to be assayed are 
incubated with possible agonist compounds. 
15 2. A f luorescently labeled D-peptide and a fluorescent ly 

labeled I-peptide with fluorophores matched for FRET are added. 

3. The D-peptide will bind to the inactive Ga/GDP complex 
and the I-peptide will bind to the either the Gq?/GTP, Gqt/GTPyS 
or Gq?/GDP complex. 
20 4. When both peptides are bound to the inactive Ga/GDP 

complex excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

5. Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 

2 5 Gof subunit from the GjS and Gy subunits, conversion to the 
activated Gof-GTP conformation, displacement of the D-peptide 
and loss of FRET. 

6. An agonist is detected by a decrease in the emission 
of light from the acceptor fluorophore. 

30 Protocol 4.10. Detection of antagonist binding using a D- 
peptide and an I-peptide 

1. Membranes containing the GPCR to be assayed are 
incubated with possible antagonist compounds. 

2. An agonist is added to produce and activation of the 
35 GPCR resulting in activation of the heterotrimeric G protein 
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to dissociate the Gor subunit from the GjS and G7 subunits and 
conversion to the activated Goi-GTP conformation. 

3 . A f luorescently labeled D-peptide and a f luorescently 
labeled I -peptide with fluorophores matched for FRET are added. 
5 4 . The D-peptide will bind to the inactive Gof/GDP complex 

and the I -peptide will bind to the either the Gq?/GTP, Ga/GTP7S 
or Gof/GDP complex. 

5. When both peptides are bound to the inactive Gof/GDP 
complex excitation of the donor fluorophore results in the 

10 emission of light from the acceptor fluorophore. 

6 . Activation of the GPCR by the agonist results in 
activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the GjS and Gy subunits, conversion to the 

^^3 activated Gq?-GTP conformation, displacement of the D-peptide 

fy 15 and loss of FRET. 

%i 7. An antagonist is detected when light is emission from 

l"l the acceptor fluorophore occurs. 

Protocol 4.11, Detection of agonist binding using a D-peptide 
12 and a Gof/f luorescent protein fusion 

l}^ 20 1. Membranes containing the GPCR to be assayed and a 

heterotrimeric G protein containing a Gof/f luorescent protein 
^=3 fusion are incubated with possible agonist compounds. 

2 . A D-peptide that is f luorescently labeled with a 
fluorophore whose excitation or emission wavelength matches the 

25 fluorescent protein fused to the Gof for FRET is added. 

3. The D-peptide will bind to the inactive Gof/GDP complex. 

4 . When the D-peptide is bound to the inactive 
f luorescent-Gce/GDP complex excitation of the donor fluorophore 
results in the emission of light from the acceptor fluorophore. 

30 5. Activation of the GPCR by an agonist results in 

activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the GjS and G7 subunits, conversion to the 
activated Gof-GTP conformation, displacement of the D-peptide 
and loss of FRET. 

35 6. An agonist is detected by a decrease in the emission 

of light from the acceptor fluorophore. 
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Protocol 4.12. Detection of antagonist binding using a D- 
peptide and a Gof/f luorescent protein fusion 

1 . Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein containing a Gof/f luorescent protein 

5 fusion are incubated with possible antagonist compounds. 

2. An agonist is added to activate the GPCR resulting in 
activation of the heterotrimeric G protein to dissociate the 
Gof subunit from the GjS and Gy subunits and conversion to the 
activated Gof-GTP conformation. 

10 3. A D-peptide that, is f luorescently labeled with a 

f luorophore whose excitation or emission wavelength matches the 
fluorescent protein fused to the Ga for FRET is added. 

4. The D-peptide will bind to the inactive Gof/GDP complex. 
%1 5. When D-peptide is bound to the inactive fluorescent- 

IJl 15 Gof/GDP complex excitation of the donor fluorophore results in 

%J the emission of light from the acceptor fluorophore. 

6. Activation of the GPCR by the agonist results in 
activation of the heterotrimeric G protein to dissociate the 

^ Gof subunit from the G/S and Gy subunits, conversion to the 

j=3 20 activated Gqj-GTP conformation, displacement of the D-peptide 

[U and loss of FRET. 

7. An antagonist is detected when light is emission from 
^3 the acceptor fluorophore occurs. 

Protocol 4.13. Detection of agonist binding using a D-peptide 

25 and a biotinylated Ga protein 

1 . Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein containing a biotinylated Gof protein 
(produced by either in vitro or in vivo biot inylation methods) 
are incubated with possible agonist compounds. 

30 2. A f luorescently labeled D-peptide and fluorescent ly 

labeled streptavidin (can also include neutravidin or avidin) 
with fluorophores matched for FRET are added. 

3. The D-peptide will bind to the inactive Gq;/GDP complex. 

4. When both peptides are bound to the inactive Gce/GDP 
35 complex excitation of the donor fluorophore results in the 

emission of light from the acceptor fluorophore. 

5. Activation of the GPCR by an agonist results in 
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activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the G(3 and Gy subunits, conversion to the 
activated Gof-GTP conformation, displacement of the D-peptide 
and loss of FRET. 
5 6. An agonist is detected by a decrease in the emission 

of light from the acceptor fluorophore. 

Protocol 4.14. Detection of an antagonist binding using a D- 
peptide and a biotinylated Go; protein 

1. Membranes containing the GPCR to be assayed and a 
10 heterotrimeric G protein containing a biotinylated Gof protein 

(produced by either in vitro or in vivo biotinylation methods) 
are incubated with possible antagonist compounds, 

2. An agonist is added to activate the GPCR resulting in 
activation of the heterotrimeric G protein to dissociate the 

15 Ga subunit from the G(3 and Gy subunits and conversion to the 

activated Ga-GTP conformation. 

3 . A f luorescently labeled D-peptide and fluorescent ly 

labeled streptavidin (can also include neutravidin or avidin) 

with fluorophores matched for FRET are added. 
20 4. The D-peptide will bind to the inactive Ga/GDF 

complex . . 

5. When both peptides are bound to the inactive Gof/GDP 
complex excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

25 6. Activation of the GPCR by the agonist results in 

activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the GjS and G7 subunits, conversion to the 
activated Ga-GTP conformation, displacement of the D-peptide 
and loss of FRET. 

30 7. An antagonist is detected when light is emission from 

the acceptor fluorophore occurs. 

Protocol 4.15. Detection of agonist binding using a D-peptide 
and fluorescent membranes 

1. Membranes containing the GPCR to be assayed and a 
35 heterotrimeric G protein are labeled with a fluorescent dye and 
then incubated with possible agonist compounds. 
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2. A D-peptide labeled with a fluorophore matched for FRET 
with the membrane dye is added. 

3. The D-peptide will bind to the inactive Gckf/GDP 
complex. . 



complex excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

5. Activation of the GPCR by an agonist results in 
activation of the heterotrimeric G protein to dissociate the 

10 Ga subunit from the G(3 and G7 subunits, conversion to the 
activated Ga-GT? conformation, displacement of the D-peptide 
and loss of FRET. 

6 . An agonist is detected by a decrease in the emission 
of light from the acceptor fluorophore. 

15 Protocol 4.16. Detection of an antagonist binding using a D- 
peptide and fluorescent membranes 

1. Membranes containing the GPCR to be assayed and a 
heterotrimeric G protein are labeled with a fluorescent dye and 
then incubated with possible antagonist compounds. 

20 2. An agonist is added to activate the GPCR resulting in 

activation of the heterotrimeric G protein to dissociate the 
Go: subunit from the Gf3 and G7 subunits and conversion to the 
activated Ga-GTP conformation. 



2 5 with the membrane dye is added. 

4, The D-peptide will bind, to the inactive Ga/GDP complex. 

5. When both peptides are bound to the inactive Gq;/GDP 
complex excitation of the donor fluorophore results in the 
emission of light from the acceptor fluorophore. 

30 6. Activation of the GPCR by the agonist results in 

activation of the heterotrimeric G protein to dissociate the 
Ga subunit from the G^ and Gy subunits, conversion to the 
activated Gof-GTP conformation, displacement of the D-peptide 
and loss of FRET. 

35 7. An antagonist is detected when light is emission from 

the acceptor fluorophore occurs. 



5 



4. When both peptides are bound to the inactive Gof/GDP 



3 . A D-peptide labeled with a fluorophore matched for FRET 
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Example 203 : Cell -Based Domain Complementation Assays Using GDP- 
dependent, GTP- dependent and/ or GDP/GTP- independent Peptides 
for the Detection of Agonists and Antagonists of G-Protein 
Coupled Receptors 

5 Certain cell-based assays for the detection of agonists 

and antagonists of estrogen receptors are described in Example 
4 and are readily adapted to the detection of such modulators 
of G-protein coupled receptors. 

Here we describe a domain complementation assay. The 

10 basic premise for the domain complementation approach is to 
take the gene that codes for a reporter molecule (in most cases 
an enzyme) , split it into two fragments (A' and B' ) and encode 
them in separate cistrons. This is done in such a way that the 
encoded fragments A and B do not by themselves interact and 

15 reconstitute a functional protein. To assemble an active 
protein, a gene encoding a fusion between fragment A and target 
protein (A: TP) is constructed and a second gene encoding a 
fusion between fragment B and a ligand (B:L) is constructed. 
If L and TP interact, they serve as the molecular glue to bring 

2 0 fragments A and B together, allowing for the reconstitution of 

activity. This activity can then be used as a read out for the 
interaction of the ligand with the target protein. Reporter 
genes that have been used in this manner are DHFR (Pelletier, 
J.N., F.X. Campbell -Valois, and S.W. Michnick, Proc Natl Acad 
25 Sci USA, 1998. 95(21): p. 12141-6, see WO9834120A1) , 
adenylate cyclase (Karimova et al . , PNAS 1998), jS-galactosidase 
(Rossi et al . PNAS 1997, Mohler et al, WO98/06648 and Proc. 
Nat. Acad. Sci. (USA), 93:12423-7 (October 29, 1996). 

Many other proteins may be used in this way and this 

3 0 example is intended to show the concept of using this technique 

for GPCRs, not limit the scope of reporter molecules. For the 
rest of this example DHFR is used. DHFR activity can be easily 
monitored in intact cells using fluorescein methotrexate . This 
fluorescent molecule readily enters cells, however, it is very 
35 efficiently pumped out by a membrane pump. Fluorescein 
methotrexate will bind to active DHFR inside the cells and is 
then retained, resulting in cells that are fluorescent. 
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Pelletier, et al . , Proc . Nat. Acad. Sci . (USA), 95:12141-6 
(October 13, 1998) . 

First a fusion between DHFRA and is made and a fusion 
between DHFRB and a T peptide. Stable or transiently 
5 transfected cell lines are then established that express both 
of these fusions. Ideally, the cell type chosen would not have 
a GPCR that would interact with the DHFR:G„ fusion, however, 
this would not be a requirement of the system. The cell lines 
would then be tested for background levels of DHFR activity. 

10 One would expect the level to be very low in the absence of a 
GPCR that would activate the G^, . Once a cell line with low 
background activity is established, a expression construct 
encoding a GPCR would then be introduced and additional cell 
lines established that now expressed all three proteins 

15 (DHFRA:G„ , DHFRB : T-pept i de , and the GPCR of interest) . 
Addition of agonist to the culture media of these cells will 
result in the activation of the GPCR, which will in turn 
activate the DHFRA :Gq, fusion causing it to release GDP and bind 
GTP. The conformational changes that accompany this activation 

2 0 and exchange of nucleoside will allow the DHFRB : T-pept ide to 
recognize and bind to the DHFRAcG^, fusion. This will result in 
bring DHFRA and DHFRB together, allowing them to reconstitute 
active DHFR. These cells will then selectively retain 
fluorescein methotrexate which is readily detectable by a 

25 number of methods, fluorimeter, fluorescence microscopy, 
fluorescence activated cell sorting (FACS) to mention the most 
widely used. This is a cell based screen for agonists of the 
GPCR introduced into the cell line. Appropriate controls for 
potential active compounds include testing the compounds on the 

30 cell lines lacking the GPCR and on cell lines lacking the 
DHFRB : T-pept ide and DHFRA :G^ fusions (no activation should be 
observed in either case) . Specificity of the potential 
agonists can be tested on cell lines expressing a different 
GPCR (again, no activation should be observed) . 

35 It is also possible to configure this system to detect 

antagonists. In this case, an agonist is added, preferably in 
subsaturating concentration, such as concentrations that will 
give approximately 50%-75%- activation of the receptor, . and 
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hence 50%-75% maximal fluorescence. Potential antagonists are 
then added at high concentrations and a drop in the 
fluorescence intensity should be observed when an antagonist 
is present. Appropriate controls for these experiments would 
5 be testing these compounds on cells with a different GPCR 
expressed as well as cells that express full length active DHFR 
(no reduction in fluorescence should be observed in either 
case) . 

Example 203 . 1 : Detection of agonists of the M2 muscarinic 
10 receptor in CHO cells 

Chinese hamster ovary (CHO) cells expressing the Gi 
Q coupled acetylcholine M2 muscarinic receptor would be used to 

screen for agonists. Alternatively Gofi could be coexpressed 
fy in CHO cells with the M2 receptor to increase expression 

^-3 15 levels. These cells contain endogenous Giof which is the 

cognate G protein for Wl muscarinic receptors. The signaling 
H protein will be an engineered version of the Ras protein fused 

to a T-peptide. This Ras fusion would contain a single amino 
Cj acid substitution mutation that makes it const ituitively 

1^ 2 0 active, a large number of which have been extensively 

,§ characterized and can be at a number of different residues. 

^.3 In addition, this Ras fusion would have a mutation in its CAAX 

box, located at the C terminus, that would remove its ability 
to localize to the cell membrane. Ras must be localized to the 
25 membrane in order to activate downstream signaling pathways. 
The T peptide-Ras fusion will be located in the cytosol and 
thus will not initiate a signal unless brought to the membrane. 
In the absence of agonist, Giof will exist in the GDP form 
and the T peptide-Ras fusion will remain localized in the 
3 0 cytosol. Upon activation of the GPCR by an agonist, which will 
cause Giof to exchange GDP for GTP, the GTP bound Gia subunits 
will attract the T peptide-Ras fusion to the membrane. Once 
localized to the membrane, the transforming Ras will initiate 
signaling through the Map Kinase pathway resulting in increased 
35 transcription on promoters containing AP-1 response elements. 
A reporter construct carrying multiple AP-1 response elements 
fused to luciferase will be used to monitor activation of this 
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pathway. Thus in the presence of agonist, an increase in 
lucif erase activity will be observed. 

To insure that this signal is specific, a system in which 
the T Peptide-Ras fusion has a mutation that prevents binding 
5 to the Gia will be utilized. In this case, potential agonists 
should not increase the activity of the AP-1 lucif erase 
reporter . 

Example 203.2: Detection of antagonists of the M2 
muscarinic receptor in CHO cells 
10 Utilizing the system described above, we can detect 

agonists for the same receptor. The experimental system is the 
same. In this case an agonist will be used at a concentration 
£j that produces half maximal induction of lucif erase activity. 

W It is important that the concentration and time dependence of 

fy 15 the agonist being used is carefully titrated to determine the 

h3 time and concentration needed to produce a linear response to 

n agonist. At 50% maximal agonist concentration, any blocking 

1==^ of binding to the receptor by antagonist will result in a 

reduction in lucif erase activity. 
□ 20 Cells would first be incubated in the presence of high 

^Ji concentrations of potential antagonists. After one hour, 

3 agonist will be added and the cells will be incubated for the 

=3 time required to provide a linear response to this 

concentration of agonist. After this time, luciferase activity 
25 will be monitored. If a compound prevents activation of the 
M2 muscarinic receptor, less of the Ga will exist in the GTP 
form and hence less of the T-peptide Ras fusion will be 
localized to the membrane. This will result in less signaling 
through the Map Kinase pathway and less AP-1 activity, 
3 0 resulting in lower transcription of the luciferase reporter and 
lower luciferase activity. 
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Example 204: Cell Based ''''Two -Hybrid'' FRET Assay Using GTP- 
dependent Peptides for the Detection of Agonists and 
Antagonists of G-Protein Coupled Receptors 

Fluorescence Resonance Energy Transfer (FRET) is a method 
5 for detection of complex formation, such as ligand-receptor 
binding, that relies upon the through-space interactions 
between two fluorescent- groups . A fluorescent molecule has a 
specific wavelength for excitation and another wavelength for 
emission. Pairs of fluorophores are selected that have an 

10 overlapping emission and excitation wavelength. Paired 
fluorophores are detected by a through-space interaction 
referred to as resonance energy transfer. When a donor 
fluorophore is excited by light, it would normally emit light 
at a higher wavelength; however, during FRET energy is 

15 transferred to the acceptor fluorophore allowing the excited 
donor to relax to the ground state without emission of a 
photon. The acceptor fluorophore becomes excited and release 
energy by emitting light at its emission wavelength. This 
means that when a donor and an acceptor fluorophore are held 

20 in close proximity (<100 Angstroms) , when one fluorophore is 
attached to a ligand and one is attached to a receptor and the 
ligand binds to the receptor, excitation of the donor is 
coupled with emission from the acceptor. Conversely, if no 
complex is formed the excitation of the donor results in no 

25 emission from the acceptor. A common modification of this 
technique, sometimes referred to as fluorescence quenching, is 
accomplished using an acceptor group that is not fluorescent 
but efficiently accepts the energy from the donor fluorophore. 
In this case, when a complex is formed the excitation of the 

3 0 donor fluorophore is not accompanied by light emission at any 
wavelength. When this complex is dissociated the excitation 
of the donor results in emission of light at the wavelength of 
the donor. 

FRET is an important technique used to examine the 
35 activation state of proteins inside of cells using 
f luorescently-labeled proteins or peptides as probes. We can 
use FRET to determine the activation state of a GPCR in cells 
by directly monitoring the activation state of its cognate G 
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protein. Briefly, the presence of GTP-bound Go? subunits 
indicates that the cognate GPCR is in its active conformation 
and a GDP-bound Got subunit indicates an inactive cognate 
receptor. One could use f luorescently labeled peptide probes 
5 that bind specifically to the GTP or GDP-bound forms of Go; 
subunits. We have identified peptides that bind specifically 
to GTP and GDP-bound forms of Ga subunits (T-peptides and D- 
peptides, respectively) as well as peptides that bind 
independent of bound GDP or GTP (I -peptides) . To examine the 

10 effects that a compound may have on a GPCR, one needs an I- 
peptide and a T-peptide or D-peptide with f luorophores that are 
matched for FRET. Two specific examples of fluorescent probes 
that are matched for FRET include the green fluorescent protein 
(GFP) and the blue fluorescent protein (BFP) . The GFP or BFP 

15 probes could be fused to the D-, T- or I-peptides to monitor 
the activation state of the GPCR by FRET. Labeled peptides 
could be forced into cells by electroporat ion , liposomes, 
internalization sequences or by any other method that allows 
peptides to enter cells. The I-peptides will bind specifically 

20 to the Gey subunit that it was isolated from during phage 
display, independent of G protein activation state, and the T- 
peptides will only bind the same Gof subunit upon receptor 
activation and subsequent G protein activation (GTP-bound Ga 
conformation) . D-peptides will be bound to inactive 

25 heterotrimeric G proteins (cognate GPCR is inactive) . In a 
cell-based system where a GPCR is functionally coupled to its 
cognate G protein and both f luorescently- labeled I- and T- 
peptides are inside of the cells, addition of a receptor 
agonist will result in an increase in intensity of the emission 

3 0 wavelength signal of the acceptor. On the other hand, if I- 
and D-peptide probes are loaded into cells, addition of an 
agonist will result in a decrease in emission signal compared 
to the signal before agonist activation. Antagonists to a 
specific receptor could be identified in a cell containing 

35 f luorescently- labeled I-peptides and T-peptides by adding Kd 
concentrations of an agonist after the putative antagonist has 
bound the receptor. In this instance, FRET will not occur if 
the compound is an antagonist. As an agonist is added to 
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antagonist bound receptor, the intensity of the emission 
wavelength signal of the acceptor may increase. This increase 
in intensity occurs as the agonist displaces the antagonist at 
the ligand-binding site. However, high concentrations of an 
5 antagonist will suppress this signal. Another way to examine 
antagonists would be to load the cells with I-peptide and D- 
peptide probes then bind antagonist and agonist as described 
above. As an agonist is added to antagonist -bound receptor, 
the intensity of the emission wavelength signal of the acceptor 
10 will decrease. 

Another method for examining the activation state of a 
GPCR using FRET would be to use a fluorescent ly- labeled T- 
£3 peptide or D-peptide and a fluorescent ly- labeled Gor subunit . 

"""i Again GFP and BFP fusions matched for FRET could be used as the 

fU 15 fluorescent probes in these assays. If T-peptide fusions were 

forced into cells, activation of the GPCR by an agonist would 
Ij allow the T-peptide fusion to bind to the activated Gc^ subunit 

fusion resulting in FRET. This receptor activation would 
result in an increase in signal intensity as described 
P 20 previously. The presence of a bound agonist would also result 

L~ in a decrease in FRET signal intensity if a D-peptide fusion 

,3 and the Go? subunit fusion are present in cells. Antagonist 

"'^ binding to the receptor could be determined using these two 

fluorescent ly-labeled fusions as previously described. 
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Example 205: Cell Based ^^One Hybrid'' Assay Using GDP- 
dependent, GTP-dependent and/or GDP /OTP- independent Peptides 
for the Detection of Agonists and Antagonists of G- Protein 
Coupled Receptors 

5 The traditional yeast two hybrid system monitors 

protein/protein interactions in the nucleus by identifying 
proteins that bring together the DNA binding domain and the 
transcriptional activation domain of a transcription factor. 
Although this system is useful for examining some types of 
10 interactions, it is not applicable to interactions that involve 
transcriptional regulators, proteins modified in the cytoplasm, 
membrane associated proteins or integral membrane proteins. 

To monitor interactions between proteins localized to the 
l^, cytoplasmic membrane, a system which detects protein/protein 

IJ 15 interactions using engineered components of cell signaling 

12 pathways could be used. In the case of a G protein- coupled 

s receptor (GPCR) , this system would be set up in a cell line 

expressing the GPCR of interest, its cognate heterotrimeric G 

•a 3? 

fij protein and a modified signaling protein. To monitor the 

20 activation state of the GPCR, we will take advantage of the 
3 fact that the alpha subunit of the associated heterotrimeric 

G protein (GJ exists in either a GTP or GDP-bound form which 
correlates to the active and inactive forms of the receptor. 
The peptides from the previous examples have been shown to bind 

25 specifically to either the GTP form (T peptides) , GDP form (D 
peptides) or either form (I peptides) of G„. One of these 
peptides would be expressed fused to a mutant of signaling 
protein which lacks a functional membrane activator. A 
signaling protein is normally a membrane bound protein which 

3 0 is active substantially only at the membrane, whereupon it may, 
possibly in conjunction with other substances, cause the 
production of a detectable signal. 

When the GPCR is in the inactive state (i.e. no agonist 
bound) , a T peptide -membrane anchor- free signaling protein 

3 5 mutant fusion would be in the cytoplasm. When the GPCR is 
activated upon agonist binding, the Ga subunit binds GTP and 
thus the T peptide-signaling protein fusion would be localized 
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to the membrane by binding to the Gc,-GTP. Once at the 
membrane, the fusion protein would activate a signaling 
pathway. In this way the activity of the T peptide-signaling 
protein is dependent on its localization to the membrane. A 
5 reporter system would be set up so that the readout of the T 
peptide-signaling protein's activity could be measured. 

For detection of agonists and antagonists of other 
membrane receptors, we would develop biokeys specific to the 
active or inactive forms of these receptors, or of a second 

10 molecule (e.g., a second messenger) which changes form as a 
result of activation with these receptors. 

The Biokey would then be fused to a signalling protein (or 
signalling protein activator) . In the case of an active from- 
specific Biokey, activation of the receptor would read to 

15 binding of the Biokey to its binding partner (the activated 
receptor itself, or the activated accessory molecule) and hence 
to localization of the fused signalling protein moiety to the 
membrane, and consequent signal production. 

An example of a signalling protein would be a Ras protein 

2 0 mutated at the CAAX box so that it is not anchored at the 
membrane in a normal manner. Hence, T fused to a T-peptide, 
the T-peptide' s movement to and from the cell membrane would 
control signalling through the Ras pathway. The mutant would 
also preferably be constitutive, so that when the Ras was 

25 brought to the cell membrane, it would be active without the 
need for any further inducer, and produce a signal. Any 
signalling protein which needs to be at the cell membrane to 
be active can be modified in a similar manner by removing its 
membrane anchor and preferably making it constitut ively active 

30 at the membrane. The Biokey is acting in place of the normal 
anchor . 
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Example 3 00 

The Use of Gof Chimeras to Screen for agonists or antagonists 
of GPCRs with OTP and GDP Specific BioKeys 

The purpose of this prophetic example is to outline a 
5 basic screening strategy. 

It has been shown that individual G protein-coupled 
receptors interact with and signal through specific Ga protein 
subunits. For example^ the M2 muscarinic receptor is. coupled 
to G^a, resulting in inhibition of AC and decreased 
10 intracellular AMP levels. This receptor does not, however, 
interact with G^of or G^cx , In this way the hundreds of GPCRs 
that exist are linked to the relatively few Gas and the 
E3 appropriate signaling pathway and biological response. The 

3 specificity for this interaction is contained in the C-terminal 

15 portion of the Ga subunits. By switching as few as three C- 
lM terminal amino acids in a Go? protein one can switch the 

IJ receptor specificity of that G alpha without changing the 

downstream effectors that the Gcx or interact with. In one 
H specific example a G^a had three residues from a G^a exchanged 

H 20 at the C- terminus. This chimeric protein, which was mostly G^, 

rg then could interact with G protein-coupled receptors (GPCRs) 

^-3 normally associated Gi . See Conklin, et al . , Nature, 363:274- 

276 (1993) . Upon stimulation with agonist the G^ associated 
GPCR now can recruit the chimeric Ga, interacting with the G^ 
25 portion and signaling through the Gg portion. Upon stimulation 
of a receptor normally coupled to G^ with an agonist, Gg 
specific downstream effectors such as inositol phosphate or 
calcium flux were measured. 

Besides this Gqa/Gi2a chimera, Conklin describes others. 
30 in which the C-terminal is obtained from G^a, G^2^ and G^a . 
Fongetal., Molecul . Pharmacol., 54:249-57 (1998) constructed 
a chimera of G^^a and G^a, in which the last six amino acids of 
the former were replaced with the corresponding residues of the 
latter . 

35 A different permutation of this idea could be used to 

enhance screening for GPCRs using Ga subunit specific BioKeys. 
One Ga could be used to create a set of Ga chimeras with the 
same backbone but different.C terminal regions allowing the 
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chimeras to be coupled to all classes of receptors. For 
example, to use the Gckfq as the backbone we could take AA's 1 
through 355 (accession number AAB64301.1) and fuse it to the 
last five amino acids of Gofi, (residues 350-354, accession 
5 number P04 8 98) Gas (residues 390-394 , accession number P04895) , 
or GQfl3 (residues 373-377, accession number NP_006563 . 1) . 
(Conklin et al . Nature 363:274-276 (1993) . It is not necessary 
to use exactly 5 AA. You can use more or less in the range 
from 3-13, most preferably 4-9. In addition although every 

10 fusion has not been made or shown to work, due to the similar 
structure of Gof's it is suspected that this scheme will be 
successful. In this way, one set of BioKeys could be used to 
screen all G protein coupled receptors using one set of well 
characterized BioKeys. The portion of the Go; that the BioKeys 

15 bind to would be held constant in the chimeric proteins while 
the portion of the Ga binding to the receptor would be varied. 

A set of cell lines expressing such a battery of chimeric 
proteins could be made and used to screen receptors without 
knowing the proper cognate Go?. The chimeras would be expressed 

20 individually or in groups as some GPCRs interact with several 
Gor's, see Guderman, et al . , Ann. Rev. Neurosci, 20:399-427 
(1997) . The screens could be done with any of the techniques 
described in the other prophetic examples including 
fluorescence based assays, enzyme activity reconstitution 

25 assays, and those assays using cell signaling components. 

Specific example 

The Use of a set of G^cx Chimeras to Screen for Agonists or 
Antagonists of a GPCR that is not Normally Coupled to Giof 

Using phage display, peptides would be selected, by 
30 methods previously described, that were specific for Gqa. The 
phage would be isolated in the presence of G^a complexed with 
GTP7S, and separately with GgOf complexed to GDP. The resulting 
phage would be tested for specificity to Gof-GTP and Gof-GDP. 
They would then be classified into GTP, GDP, and independent 
35 phage (T, D, or I peptides respectively) . Gq chimeras would 
be made by switching the C- terminal 3-13 residues with all the 
other Gofs . Peptides .would then be made from the isolated phage 
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sequences. These peptides would be used in an assay, e.g., a 
fluorescence polarization assay, to make sure that the peptides 
retained the same specificity for the chimeric Gas as they 
displayed for G^a . We would expect this to be so as the 
5 GTP/GDP binding sites are removed from the C-terminus. Once 
this was established the Go: chimeras would be tested to assure 
they were associating with the receptor of interest. This will 
be done either by introducing the chimera into a membrane prep 
or by expressing the chimeric protein in a cell which expresses 

10 the GPCR being studied. Downstream signaling events of Gg such 
as an increase in inositol phosphate or calcium will be assayed 
after treatment with an agonist to check for proper coupling. 
I"? If the receptor and the chimeric Gas are interacting then 

screening can be set. up looking for novel agonists and 

15 antagonists as described in the other prophetic examples. 

i '■■ ^ 
s y 
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Example 4 01 Fingerprinting of Modulators of the Glucocorticoid 
Receptor 

Peptide Sequences 
F6 : GHEPLTLLERLLMDDKQAV 
5 Qf/ieill: SSWDMHQFFWEGVSR 
Of/iSV: SSPGSREWFKDMLSR 
Of II: SSLTSRDFGSWYASR 

Note that these peptides, while originally identified as 
peptides which bind the estrogen receptor, are usable in 

10 fingerprinting of modulators of the glucocorticoid receptor. 
The ER binding peptides work on the glucocorticoid receptor 
because nuclear receptors have structural similarities. The 
exact nature of these similarities are not known although there 
are sequence similarities. Identical coactivator proteins bind 

15 both receptor and contain LXXLL motifs. Thus it is not 
surprising that our LXXLL peptides might also bind both 
receptors in the presence of agonist. See Mclnerney, et al . , 
Genes & Development, 12:3357-68 (1998); Nolte, et al . , Nature, 
395:134-143 (September 10, 1998). 

2 0 Titration of GR vs. F6 with Deoxycorticosterone and 

Dexamethasone 

Yeast strain EGY48 (MATa trpl his3 ura 3 leu2 : : GLexAop- 
Leu2) was transformed with plasmids pjK103 (2/iM, 2LexAop-LacZ) , 
pJG4-5-F6 (2/xM, LexADBD-F6 peptide) , and pEG202-GR (2 juM, 
25 B42AD-Glucocorticoid Receptor a) . The resulting transformed 
strain was grown overnight in media containing galactose as the 
sole carbon source to induce expression of GR. 
Deoxycorticosterone and dexamethasone were serially diluted 
into 100 )Lil of media in a 96-well microplate. 100 fil aliquots 

3 0 of the overnight yeast culture were added to the microplate 

wells and incubated at 30^C for 3 hours. To monitor the 
interaction of the F6 peptide with GR, a kinetic assay for 
galactosidase activity was performed. The cell density in each 
was determined by reading the OD^^q. Yeast were pelleted by 
35 centrif ugation for 5 minutes at 3000 rpm and the media removed. 
2 0 ^1 of IxZ buffer (60 mM Na2HP04 4 0 mM NaH2P04 10 mM KCl 1 mM 
MgS04 7 mM 2 -mercaptoethanol ) containing 2.5% CHAPS detergent 
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was added and briefly mixed by aggitation. Following a 5 
minute incubation at room temperature, 100 /xl of IxZ buffer 
containing 4 0 jug o-Nitrophenyl jS-D-Galactopyranoside (ONPG) was 
added to each well . Color development was monitored by 
5 measuring the change in OD405 referenced to ODg^o over 10 minutes 
(20 second intervals) . j8-galactodase activity is expressed as 
A (OD405-OD650) /initial ODsso- 

Interaction of GR with peptides 

Yeast strain EGY48 (MATof trpl his 3 ura3 leu2 : : 6LexAop- 

10 Leu2) was transformed with plasmids pJKlOB (2 /xM, 2LexAop- 
LacZ) , pEG202-GR(2 /xM, LexADBD-F6, -a/fSlll, -Qf/jSV, or -all 
peptides) . The resulting transformed strain was grown 
overnight in media containing galactose as the sole carbon 
source to induce expression of GR . The culture was diluted to 

15 an ODgQo of 0-1 i^i 10 ml of media and deoxycorticosterone., 
dexamethasone , cort icosterone , or j8-estradiol were added to a 
final concentration of 1 /xM . The cultures were incubated at 
30*^C for 3 hours. Preparations of protein were made by lysing 
the yeast by aggitation with glass beads. The cellular debris 

2 0 was removed and the protein concentrated by precipitation with 
50% ammonium sulfate for 30 minutes at 4^C. The protein pellet 
was suspended in storage buffer (100 mM HEPES 50 mM EDTA 4 0% 
glycerol 7 mM 2 -mercaptoethanol , pH8) and protein 
concentrations determined. To determine the interaction of the 

25 peptides with GR, an end point assay for jS-galactosidase 
activity was performed. 10 jug of protein extract was diluted 
into a final volume of 100 //I IxZ buffer and color development 
was initiated by the addition of 80 fxg of ONPG. The reactions 
were stopped by addition of 3 0 [jlI of IM Na2C03 and the time of 

30 development noted. )8-galactosidase activity is expressed as 
1000*OD4Q5/min/mg protein. 
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The invention, as contemplated by applicant (s) , includes 
hut is not limited to the subject matter set forth in the 
appended claims, and presently unclaimed combinations thereof . 
It further includes such subject matter further limited, if not 
5 already such, to that which overcomes one or more f the 
disclosed deficiencies in the prior art. To the extent that 
any claims encroach on subject matter disclosed or suggested 
by the prior art, applicant (s) contemplate the invention (s) 
corresponding to such claims with the encroaching subject 
10 matter excised. 

All references cited anywhere in this specification 
are hereby incorporated by reference, as are any references 
^3 cited by said references . 
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Table A: List of Proteins for Fingerprinting Analysis 



Receptors 
Nuclear receptors 
5 Estrogen Receptor a. 

and iS 



Progesterone 



10 



15 



20 



Androgen 

Glucocorticoid 

mineralocort icoid 

Retinoic acid 

Thyroid 
Vitamin D3 
PPAR (s) 

LXR 

FXR 

BXR 

SXR 



Orphan Nuclear Receptors 
Nurrl 
Norl 



Modulators of Activity 

Estradiol (agon) , 
tamoxifen (antag) , ICI 
182,780 (antag). Raloxifene, 
(antag) , 

Progestins , estrogens 
(agon) , RU486 (antag) , 
ZX98299, (antag) , onapristone 
(antag) 

Dihydroxy testosterone 
(agon) , hydroxyf lutamide 
(antag) 

Cortisone (agon) , 
dexamethasone (agon) 

Aldosterone (agon) , 
spironolactone (antag) 

9 -cis retinoic acid 
(agon) 

Thy ro i d hormone ( agon ) 

Vitamin D3 (agon) 

Eicosinoids (agon) , 
oxidized LDL (agon) 

Oxidized cholesterol 
metabolites (agon) 

Farnesoid metabolites 
(agon) 

3 -aminoethyl benzoate 
(agon) 

Steroids (agon) , 
phytoestrogens (agon) , 
xenobiotics (agon) 
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NGFl-B 
ERRl 
SHP 
HNF-4 

5 Coup-TF II 

EGF (agon) , ATP 

Insulin (agon) , ATP 
PDGF (agon) , ATP 



Tyrosine 



10 



Kinase Receptors 
Epidermal 
growth factor 
Insulin 

Platelet 
derived growth 
factor 



1= 



G-Protein Coupled 
Receptors 

15 jS-adrenergic Isopreterenol (agon) , 

receptor alprenolol (antag) 

Rhodopsin 

Dopamine D2 Dopamine (agon) , 

haloperidol (antag) 

opiod Leu-enkephalin (agon) , 

Na 1 1 r i ndo 1 e ( an tag) 



2 0 Endothelin 

Erythropoietin receptor 
FAS ligand receptor 
Inter leukin receptor 

S igna 1 Transduction 
25 Proteins 

Kinases 

Protein Kinases 
Protein kinase C 

Tyrosine kinase 



Endothelin 1 (agon) , BQ- 
123 (antag) 

Ery thropoie t in 
FAS ligand 

Interferon (agon) IL-6 
(agon) 



diacylglycerol (agon) , 
staurosporine (antag) 
ATP, genistein (antag) 
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Serine kinase ATP 
Threonine kinase ATP 
Nucleotide kinase ATP 
Polynucleotide kinase ATP, DNA, PO4 

5 Phosphatase 

Protein Phosphatase 

Serine /threonine 
Tyrosine 
Nucleotide phosphatase 
10 Acid phosphatase 

Alkaline phosphatase 
pyrophosphatase 



Cell Cycle Regulators 
Cyclin CDK-2 
15 CDC2 
CDC2 5 
p53 

Retinoblastoma 
GTPases 
20 Large G proteins 

Gofs suramin (antag) , 

mastoparin (agon) 
Small G Proteins GAPs (ag) , GEF (antag) 

Rac 

2 5 Rho 

Rab 
Ras 

Proteases 

Endopro tease 

3 0 Exprotease 

Metal loprot ease 
Serine protease 
Cysteine protease 



Nucleases 



35 



Polymerases 
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Ion Channels 

Chaperonins 

Heat shock Proteins 

Viral Proteins 

5 Deaminases 

Nucleases 

Deoxyribonuc lease 
Ribonuclease 
Endonucleases 
10 Exonucleases 

Polymerases 

DNA dependent RNA polymerase 
DNA dependent DNA polymerase 
Telomerase 
15 Primase 

Helicase 

Dehydrogenase 

Aminoacyl tRNA synthetases 

Transferases 

20 Peptidyl transferase 

Transaminase 
Glycosyl transferase 
Ribosyl transferase 
Acetyl transferases 

2 5 Acyltransf erases 



Hydrolases 



Carboxylases 

Isomerases 

Dismutase 
Rotase 

Topo i s otne rase 

Glycosidase 

Endoglycosidase 
Exoglycosidase 

Deaminase 



10 Lipases 

Esterases 
Sulf atases 
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Cellulase 
Lyases 
15 Reductases 
Synthetase 

DNA binding proteins 

RNA binding proteins 

Nuclear receptor coactivators 

20 Ligases 
RNA 
DNA 



Tumor suppressor 
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Adhesion molecule 

Oxygenase 

Peroxidase 

Transporters 
5 Electron transporters 

Protein transporters 
Peptide transport 
Hormone transport 
Serotonin 
10 DOPA 

Nucleic acid transport 

Transcription factors 
Neurotransmitters 
Information carrier/ storage 
15 Antigen recognition protein 

MHC I complex 

MHC II complex 

Antag=antagonist of receptor 
agon=agonist of receptor 
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Table B: Target Tissues 
Circulatory and Lymphatic Systems 
Heart 

Walls 

5 Valves 

Blood Vessels 
Blood Cells 

Erythrocytes . 
Platelets 
10 Leukocytes 
Lymph Nodes 
Lymphatic Vessels 
Spleen 
Thymus 
15 Tonsils 



Respiratory System 
Lungs 

Trachea 
Bronchi 

20 Bronchioles 

Alveoli 
Pleura 
Pharynx 
Larynx 
2 5 Trachea 



Endocrine System 

Pituitary Gland 
Thyroid Gland 
Parathyroid Gland 
3 0 Adrenal Gland 

Adrenal Medulla 
Adrenal Cortex 
Pancreas 

Islets of Langerhans 

3 5 Liver 

Gall Bladder 
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Mammary Glands 
Central Nervous System 
Brain 

Neurons 

5 Glial Cells 

Spinal Cord 
Nerves 

Peripheral Nervous System 
Eye 

10 Retina 

Lens 

Ear 

Eardrum 
Ampullae 

15 Spiral organ of Corti 

Nose 

Olfactory bulbs 
Tongue 

taste buds 

20 Digestive System 

Tongue 

Salivary Gland 

Pharynx 

Esophagus 

2 5 Stomach 

Small Intestine 
Large Intestine 

Urinary System 
Kidney 

3 0 nephrons 

Bladder 

Male Reproductive System 
testes 

prostate gland 
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bulbourethral (Cowper's) glands 
penis 

sperm cells 

Musculoskeletal System 
5 . bones (various) 

bone marrow 
joints (various) 
muscles (various) 
ligaments (various) 

10 Female Reproductive System 

Ovaries 
^^3 Uterus 

Bartholin's Glands 
%j Paraurethral Glands 

;^ 15 Egg Cells 

H Integumentary System 

rJ Skin 
^'Z epidermis 
%3 dermis 

2 0 hypodermis 

sweat glands 

sebaceous glands 

hair 

nails 



237 

Table 1 

Peptides the Bind to the Unliganded (unactivated) 
Estrogen Receptor 

Sequence Phage # 

5 SRWESPLGTWEWSR 4 

SAAPRTISHYLMGG 48 

SSWVRLSDFPWGVSR 1 

SSWDRLSDFPWGVSR 2 

SSWIRLRDLPWGESR 3 

10 SSWVLLRDLPWGSR 31 

SSWVVLRDLPWGSR 29 

SSCKWYEKCSGLWSR 7 

SSGICFFWDGCFESR 35 

SRNLCFFWDDEYCSR 41 

15 HHHRHPAHPHTYGG 47 

Table 2 

.3 Peptides that Bind to the Estradiol Activated 

Receptor 

fil Sequence Phage # 

20 SRAGLLSDLLiEGKSR 1/2 

1% SSRSLLRDLLMVDSR 6 

SSNKLLYNLLKMESR 22 

;2 SSKSLLLNLLSTPSR 23 

^ ' HSFPRESLLVRLLQGG 42 

^25 SRLEMLLRSETDFSR 3 

SRLEELLKWGSVTSR 11 

zi SRLEQLLKEEFSYSR 21 

1^ SRLEQLLRSEPDFSR 27 

£^ SRLEDLLRAPFTTSR 28 

30 SRLESLLRFGQLDSR 29 

%J SSRLLSLLVGDFNSR 19/20 

SRLEELLLGTNRDSR 30 

SRLKELLLLPTDLSR 15 

SRLECLLEGRLNCSR 34 

35 SSKLYCLLDESYC SR 35 

SRLSCLLMGFEDCSR 36 

SSKLIRLLTSDEELSR 37 

SSRLMELLQEGQGWSR 40 

SSNHQSSRLIELLSR 4 

40 SSRLWQLLASTDTSR 16 

SSNSMLWKLLAAPSR 13/14 

SSKTLWRLLEGERSR 17 

SRAGPVLWGLLSESR 32 

SSLTSRDFGSWYASR 5 

45 SSWVRLSDFPWGVSR 24/25 

SSEYCFYDSAHCSR 33 

SRSLLECHLMGNCSR 7 

SSELLRWHLTRDTSR 8 

SRLEYWLKWEPGPSR 12 

50 SRSDSILWRMLSESR 31 

SSKGVLWR ML A E P V S R 38/39 

HSHGPLTLNLLRSSGG 41 

SSAGGGAPAGSTPSR 26 
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other ER binding peptides include 
SSKYSYSRSSEGHSR 
SSYQWETHSDKWRSR 
SSVTKKALTIAKDSR 

5 The latter two are weak binders of ER in presence of estradiol. 
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Table 3 : Phage/Peptide CI 
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Notes to Table 14 : 

Fingerprint analysis of estrogen receptor modulators 
on (A) ER Ckf and (B) ER ^8. Immobilized ER was incubated with 
estradiol (1 /xM) , estriol (1 /xM) , premarin (10 ^M) , 4-OH 
tamoxifen (1 /xM) , nafoxidine (10 juM) , clomiphene (10 jitM) , 
raloxifene (1 /xM) , ICI 182,780 (1 /xM) , IGof-OH estrone (10 /xM) , 
DES (1 /xM) or progesterone (1 /xM) . Phage ELISAs were conducted 
as described . 
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s 


R 


L 


S 


C 


L 


L 


M 


G 


F 


E 


D 


c 


S 


R 




20 










s 


s 


K 


L 


I 


R 


L 


L 


T 


S 


D 


E 


E 


L 


s 


R 


^~ 












s 


s 


R 


L 


M 


E 


L 


L 


Q 


E 


G 


Q 


G 


W 


s 


R 


f II 


S 


S 


N 


H 


Q 


s 


s 


R 


L 


I 


E 


L 


L 


S 


R 


























s 


s 


R 


L 


W 


Q 


Ii 


L 


A 


S 


T 


D 


T 


s 


R 




1; y 












s 


s 


K 


L 


W 


Q 


L 


L 


S 


s 


P 


I 


D 


s 


R 






25 












s 


R 


L 


V 


A 


L 


L 


K 


s 


P 


W 


S 


V 


S 


R 










S 


S 


N 


s 


M 


L 


w 


K 


L 


L 


A 


A 


P 


S 


R 








El: 










S 


S 


K 


T 


L 


w 


R 


L 


L 


E 


G 


E 


R 


S 


R 












S 


R 


A 


G 


P 


V 


L 


w 


G 


L 


L 


S 


E 


S 


R 


















S 


R 


S 


P 


I 


L 


T 


H 


L 


L 


S 


L 


G 


S 


R 










30 






S 


S 


T 


G 


I 


L 


w 


K 


L 


L 


T 


A 


E 


S 


R 
















s 


S 


H 


G 


I 


L 


w 


R 


L 


L 


S 


E 


G 


S 


R 









.5 B 







K 


L 


V 


Q 


L 


L 


T 


T 


T 


A 


E 






I 


L 


H 


R 


L 


L 


Q 


E 


G 


S 


P 


35 


SRCla 


L 


L 


R 


Y 


L 


L 


D 


K 


D 


E 


K 






L 


L 


Q 


Q 


L 


L 


T 


E 










CBP 


Q 


L 


S 


E 


L 


L 


R 


G 


G 


S 


G 






Q 


L 


V 


L 


L 


L 


H 


A 


H 


K 


C 






Y 


L 


E 


G 


L 


L 


M 


H 


Q 


A 


A 


40 




L 


L 


A 


S 


L 


L 


Q 


S 


E 


S 


S 






H 


L 


K 


T 


L 


L 


K 


K 


S 


K 


V 




RIP140 


Q 


L 


A 


L 


L 


L 


S 


S 


E 


A 


H 






L 


L 


L 


H 


L 


L 


K 


S 


Q 


T 


I 






L 


L 


Q 


L 


L 


L 


G 


H 


K 


N 


E 


45 




V 


L 


Q 


L 


L 


L 


G 


N 


P 


K 


G 






L 


L 


S 


R 


L 


L 


R 


Q 


N 


Q 


D 






V 


L 


K 


Q 


L 


L 


L 


S 


E 


N 


C 



SRCla = human steroid receptor coactivator la, 
CBP = mouse cAMP- responsive element (CREB) -binding 
50 protien, 

RIP 140 = human RIP140 
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Table 101 



Class I 

ER4 SSNHQSRLIELLSR 
D2 GSEPKSRLLELLSAPVTDV 
5 D3 0 HPTHSSRLWELLMEATPTM 

D 1 1 VE SGS SRLMQLLMANDLLT 

Class II 

D4 7 HVYQHPLLLSLLSSEHESG 
C33 HVEMHPLLMGLLMESQWGA 
10 D14 QEAHGPLLWNLLSRSDTDW 

Class III 

F6 GHEPLTLLERLLMDDKQAV 

D2 2 LPYEGSLLLKLLRAPVEEV 

D4 8 SGWENS ILYSLLSDRVSLD 

15 D43 AHGESSLLAWLLSGEYSSA 

Dl 7 GVFCDS ILCQLLAHDNARL 

D4 1 HHNGHS ILYGLLAGSDAPS 

D2 6 LGERASLLDMLLRQENPAW 

D4 0 SGWNESTLYRLLQADAFDV 

2 0 Dl 5 PSGGSSVLEYLLTHDTS IL 

F4 PVGEPGLLWRLLSAPVERE 

Misc . 



Dl 0 WEEHSQMLLHLLDTGEAVW6 



ER/Jsp. #2 93 SSIKDFPNLISLLSR 

25 GRI P - 1 NRl DSKGQTKLLQLLTTKSDQM 

NR2 LKEKHKILHQLLQDSSSPV 

NR3 KKKENALLRYLLDKDDTKD 



SRC-1 

30 



NRl 
NR2 
NR3 



YSQTSHKLVKLLTTTAEQQ 
LTARHK I LHRLLQEGS PSD 
ESKDHQLLRYLLDKDEKDL 
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Table 202A; Gial GDP/GTP7S- Independent Phage (I-Peptides) 



ID 


Seauence /Mot i f Al icrned 


Lia 


Library 


99 


SRAHLLTWSEFLDSHTK 


BUF 


E 


103 


SSGELITWYEFLGDLNP 


BUF 




107 


SRGELTTWYEFLSHGRP 


BUF 


E 


361 


DELTWWEFISD 


GTP 


K 


"3 Q Q "2 Q T 
0 0 0 , 0 ^ ± 


V 1 W X LJr Lji*lj1iLJ l in. 


Kj 1 ir 




45 


GLMTWREFLQE 


BUF 


R 


397, 401, 412 


NLMTWYEYLADGERL 


GTP 


Y 


15r2, 301, 394 


ADRLWTWQEFLY 


BUF 


PHD 12 


380, 381, 140 


KTYSLYEFLEL 


GTP 


N 


16 


SSQLLTLHEFLNS 


BUF 


H 


360 


SSRGEYWWEFLGYSR 






101 


S S ADG I FWWE YARE AGE 


BUF 




375, 123 , 125, 247 


LGRGTTDMPPWAWWS 


GTP 




331, 334 


NYTERPWVWYH 


GDP 




37 


SSLYSMEPWKWYT 


BUF 




387 


KWWESDWFVNFG 


GTP 




386 


EEGMDWFMRWE 


GTP 
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1 Peptide Consensus I-l 

(the # of positions is relative, #1 need not be the N- terminal 
of the peptide) 

1: strongly hydrophobic, aliphatic Aas (L.I.V.M) favored, but 
5 note that T occurs 

2 : T 

3 : W 

4: aromatic (incl. Y, W) , charged basic (e.g., R) , nonpolar 
hydrophilic (e.g., S, Q) all known. May be fairly tolerant of 
10 substitution 

5: charged acidic (D, E) 
6: aromatic (incl, F, Y) 

7: mostly L, but I and A also accepted. Possible preference 
for hydrophobic, esp. aliphatic 

15 Consensus 1-2 

1 aromatic (incl. Y, F) 

2 W 

3 W 

4 E 

20 5 aromatic (incl. F, Y) \ 
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Table 202B; Gial GTP-Specific Phage 

(T-Peptides) 

370,377,378 SVLSSSEMCFGWACY GTP M 

244 SEMCFGWACY GDP PARO 

5 3 6 6, Gl 2 FNEVCLGWQCY GTP K 

G3 3 , G3 4 SSNARPCQGWHCYLPSQSR 

3 53 WDGGVWMGPAS GTP K 

408 MGDSVLPYGGVWLGP GTP Y 

G2 2 , G2 5 SRYGGVWLGPEGNSR 

10 Gil, G2 6-29 SSWDGGVWWGQYGSR 

G9 , GIG SSNLDGCFTSGGVWSGCSR 

3 82 LGYDINGVWIG GTP N 

384 ICDIIPWEESCSR GTP P 

413 ACGPAI CPWDFMPQL GTP PARO 

15 Note: clone 244, which was identified in a screen for peptide 
which bound GDP:G-alpha, is suspected to having increased the 
affinity of the G-alpha for GTP through a conformational 
change . 
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T-Peptide Consensus T-1 

1 charged AA (esp. E, R) 

2 M, V, P, otherwise uncertain 

3 C 
5 4 X 

5 G 

6 W 

Consensus T-2 

1 G 
10 2 G 

3 V 

4 w 

Consensus T-3 

1 I 
15 2 X 

3 P 

4 W 

5 D, E 
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Table 202C: G±otl GDP-Specific Phage 
(D-Peptides) 

G4 



SRGPQLTWQEFLTGAASSR 





314 




NWTWWEFLGP 


GDP 




5 


73 




SREFVTWKEFLGS 


BUF 


K 




343 




SQLTWREFLFG 


GDP 


R 




217 




SSHLMTWHEFISD 


GDP 


H 




93 




SRDGFETWAEFLGASGS 


BUF 






62 




SRLTWSEYLSEIDP 


BUF 


CWL 


10 


193 




SRTVTWVDFLKET 


GDP 


D 




324 




MSWYEFMTEESM 


GDP 


CWI 




400 




AKHDLSWYEFLQLP I 


GTP 


V 




281 




SRLSWWEFLGASDCGTC 


GDP X14C<W 




359, 


161 


DLLSLKEFLAT 


GTP 


K 


15 


176 




SSPNLLTLEEFLS 


GDP 


L 




380, 


381 , 140 


KTYSLYEFLEL 


GTP 


N 




409, 


24r2 


MSNRYT I YE FLNLHS 


GTP 


Y 




320 




LHWWEVLAEK 


GDP 


CWL 




230 




S S PQ PLLHWWEMMTE P P 


GDP 


KNK 


20 


213 




SRAGESVHWWEVL 


GDP 


H 




266 




RAGPSEHWWEYIATL 


GDP 


N 




237 




EM I S WHQ YLL S I ENN 


GDP 




126, 


128,133,242,248 


SSLRWDEFLMELGGGVA 


BUF 


M 




379 




VPWWVWLAEGD 


GTP 


N 


25 


196 




SREIYWWDWLTDT 


GDP 


D 




117 




FGSNMLDLPTFLDWL 


BUF 


PAR( 




92 




SRITFWELMLEGG 


BUF 


L 




179 




S RT P YE WLG YWGA 


GDP 


L 




289 




YDMCTWLE FLDGGE C 


GDP 


X14( 


30 


265 




SPLCTWAEYLMEPSC 


GDP 


N 




273 




TQWCTWAEFLSSTDC 


GDP 


M 




272, 


282 , 6R2 


SSDGCTWQEFLAGHGPC 


GDP 


N 



PARC 



35 



337, 

268 

330 

329 

280 

319 

115 



339 



PFNNPPWMWWS 
S S PT VHENL P P WLWWS P 
LIHVPPWAWYD 
GFDVPPWYWDF 
YSQVFGDAPVWAWYSSR 
WTPSDWQWWRSK 
SSHWSSDSIFPGFWYSG 



GDP 
GDP 
GDP 
GDP 
GDP 
GDP 
BUF 



P 
N 
P 
P 

X14CW 

CWL 

PARO 



40 



197 
347 



SRGGVDLDIGNSA 
EGEDVRTRIAN 



GDP 
GDP 



D 

R 
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D-peptide Consensus D-1 

1 small and/or hydrophilic Aas , esp T, S, H, P, E, G, 

2 hydrophobic, esp W, L, I 

3 X 

5 4 E and W (a rather exotic pair, so perhaps a tolerant 
position) 

5 aromatics (F,Y,W) and also aliphatic hydrophobic (at least 
V, M) 

6 aliphatic hydrophobic 9at least L, I, M) but not exclusively 

10 Consensus D-2 

1 W 

2 W 

3 X 

4 W 

15 Consensus D-3 

1 W 

2 X 

3 E 

4 aromatic (at least F, Y) 
20 5 L 
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